A Generic Framework for Diamond Lemmas 

Lars Hellstrom* 



Abstract 

This paper gives a generic form of the diamond lemma, which in- 
cludes support for additive and topological structures of the base set, 
and which does not require any further structure (e.g. an associative 
multiplication operation) to be present. This result is intended to be 
used as the core of diamond lemmas for particular algebraic structures, 
taking care of all the common technicalities. With this generic diamond 
lemma, the main steps needed to prove a specialised diamond lemma 
is to define the reduction maps and analyse the structure of critical 
ambiguities. 

The abstract machinery is backed up with concrete suggestions for 
how one should set things up in order to reproduce traditional results 
in the general setting. Several instances of the fundamental theorem 
of Grobner basis theory are derived as corollaries of the main result. 



1 Introduction 

The Diamond Lemma for Ring Theory [2j of Bergman is an important the- 
orem that links together several branches of mathematics. On one hand it 
is the bridge between associative algebra and mathematical logic that can 
make the definition of an algebra through generators and relations effective. 
On another it marks a middle ground between the theory of Grobner bases 
and the theory of term rewriting, which can be seen as belonging to either 
of the two. Yet it is only one member in a family of results on similar con- 
nections, which can be quite different in their technical details even though 
the essential ideas are mostly the same. Furthermore many of these results 
exist in the literature only as sketches (which, it seems, everybody is waiting 
for someone else to flesh out, as it is all so "obvious" anyway), and as a re- 
sult the rigor in many arguments becomes somewhat lacking, as they should 
rightly have been proofs making use of some particular diamond lemma. 

My intention here is to state and prove a generic form of the diamond 
lemma from which one can easily derive more specialised results suitable 
for particular problems. It is probably not the most generic form that is 
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possible, but it can deal with the technicalities in all cases I know of, and 
does so without making extensive assumptions about the structure to which 
it is applied. 

From a strictly technical perspective, the theorem given here generalises 
that of Bergman in three directions: 

1. A topological aspect is added to the basic machinery. This makes it 
possible to treat e.g. formal power series problems within the diamond 
lemma framework. 

2. The assumptions about a multiplicative structure have been dropped 
from the core theorem. Auxiliary theorems are provided which to- 
gether with the core theorem cover what the associative algebra dia- 
mond lemma can do, but also apply for a much broader range of alge- 
braic structures (nonassociative algebras, operads, PROPs |10j . etc.). 

3. The definition of reductions has been separated from the diamond 
lemma, so that it no longer depends on these having a particular form 
or that all reductions of a particular form are active. The latter is 
useful if one wishes to cover Shirshov's theory of bases for Lie alge- 
bras il4l . 




The first generalisation was the subject of my Ph.D. thesis [7], but the pre- 
sentation here has been refined in that it eliminates many minor assumptions 
on how the multiplicative and topological structures interact. Readers who 
want concrete examples may however prefer the thesis presentation, as it 
treats some applications in great detail. 

The main advantage of the topological aspect is that it enables one to 
handle both polynomials and power series (or their respective counterparts 
from less traditional algebraic structures) using the same machinery. A less 
apparent advantage is that problems that can be posed entirely in terms of 
finite sums (i.e., polynomials) sometimes have solutions where the normal 
form is an infinite sum (power series), and in this case one has to employ 
the topologized version in order to prove things about this normal form. In 
so doing, one can take advantage of certain relaxations of the conditions 
of the classical result; Definition 15.11 of the descending chain condition and 
Definition 15.91 of ambiguity resolution both admit more than in Bergman's 
diamond lemma. 

The second generalisation has been the main direction in my subsequent 
work, initiated in response to a question from Loday on whether there is a 
diamond lemma for operads. It's not too hard to see that there is such a 
creature — more work had to be spent sorting out the details of claims than 
the details of their proofs — but one fundamental change when going from 
algebras to operads is that one goes from a single-sorted algebraic structure 
(there is one set of elements) to a multiple-sorted algebraic structure (in 
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an operad, elements of different arities don't mix, and hence there is a 
separate sort of element for each arity). The interactions between these 
elements of different sorts is certainly a kind of multiplicative structure, but 
one that syntactically is much more unwieldy (regardless of whether one 
prefers to phrase it using the structure map formalism or the ith composition 
formalism) than that of a ring, and bundling these interactions with the 
diamond lemma would turn an already very technical result into something 
even worse. Furthermore the generalisations do not stop at operads. There 
are good reasons to at least go on to PROPs (because each operad is a part 
of some PROP, and PROPs have a more concise set of axioms), and after 
that there are more general diagrammatic structures that one may wish to 
consider. Handling them all in one result does not seem a likely achievement. 

What turns out to work is instead to separate the parts of the classical 
diamond lemma that deal with the multiplicative structure from the parts 
that ignore this structure. The core of the diamond lemma (Theorem 15. IH 
with the familiar equivalence of four different conditions) can be very neatly 
captured as a result on one sort (hence ignoring the multiplicative structure 
under which sorts may interact), whereas the construction of reductions 
and verifications that some ambiguities are trivially resolvable fall into the 
other part. This is actually rather fortunate, because the first part will then 
deal with the classical technicalities, whereas the second will deal with the 
particular features of rings, operads, PROPs, or whatever; there is an almost 
complete separation of responsibilities. 

The third generalisation is thus in part a natural consequence of the 
second, but there are also other advantages to it. One is that many defining 
identities of the classical nonassociative algebras do not fit well to make "un- 
conditional" rules from; a simple example is the anticommutativity identity 
of a Lie algebra, which if expressed as a rule — > —[y,x\ for all x and 
y' would lead to the infinite rewrite cycle — > — — (— = 

[x, y] ^ — [y, x] — > • ■ ■. One way to handle that in practice is to instead make 
a conditional rule ^[x,y\ —[y,x] if x > y' out of it, and the machinery 
constructed here can handle that; since each pair (x, y) of factors gives rise 
to a separate reduction map, it is merely a matter of considering only those 
pairs for which x > y in some suitable ordering of the factors. While there is 
a certain price to pay in that ambiguity resolution becomes less automatic, 
this price effectively only comes into play when the conditional rules are 
involved. 

The structure of this paper is as follows. Sections [2] and [3] introduce 
the framework within which the core diamond lemma (Theorem 15. lip is 
formulated. These sections also contain plenty of minor constructions for 
setting up various aspects of this framework, to illustrate features of the 
formalism used, and to aid the reader in applying the results. 

Sections H] and [5] contain the bulk of the proof of the diamond lemma. 
The former section is about more abstract algebraic-topological properties of 
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monoids of maps, whereas the latter introduces an order and uses induction 
to link these properties to conditions that can be verified through explicit 
calculations. Notable lemmas are 14.101 (linking normal form uniqueness to 
univocality of the pointwise limit of reductions), 15.51 (existence of normal 
forms), and 15.81 (uniqueness of normal form given relative resolvability of 
ambiguities). Besides the main Theorem 15. 11^ there is also Theorem 15.61 
which provides a characterisation of irreducible elements. 

Section [6] is about ambiguities (a.k.a. critical pairs or overlaps) and how 
one in a multisorted situation can discard non-critical ambiguities from con- 
sideration. This is as much about defining 'critical ambiguity' — a subject 
which spans definitions 16.11 [6^ and 16.81 — as it is about proving them dis- 
cardable. The claim that checking the critical ambiguities is as good as 
checking all ambiguities can be found in Theorem l6.91 Example 16.101 derives 
Bergman's diamond lemma from the generic theory. Theorem 16. 121 is aimed 
more at completion calculations; it justifies dropping unnecessary rules while 
in the middle of completing a rewriting system. 

Section [7] collects a construction and some technical lemmas that may be 
used in applications to demonstrate that the most common setting (a collec- 
tion of free modules) leads to a framework suitable for the generic diamond 
lemma. Again the aim is to bridge the gap between concrete conditions that 
are easy to verify and more abstract conditions used in the generic theory. 

The final Section [8] is about Grobner bases, where Theorem 18.41 ex- 
tends the big equivalence in the generic diamond lemma with some GB-style 
claims. Several instances of "the fundamental theorem on Grobner bases" 
(in commutative, associative, and nonassociative polynomial algebras) are 
derived as corollaries of this theorem, and the theory is shown to also cover 
the case of path algebras. 

A more practical application of the generic diamond lemma theory can 
be found in [Sj. Unlike the applications in Section [HI this exercises the 
multisorted aspects of the framework. 

Notation 

The set N of natural numbers is considered to include 0. Z"*" is the set of 
positive integers and is the set (sometimes the multiplicative group) of 
positive real numbers. The shorthand f{A) for | /(a) | a S A | is frequently 
applied. 

Formal variables are typically written using a sans-serif font: a, b, c, etc. 
When X is a set of such letters, X* denotes the free monoid on X, i.e., the 
set of all finite strings of elements from X. The identity element in X* is 
denoted 1. 

On the matter of monomials versus terms, a monomial is considered to 
not include a coefficient, whereas a term generally contains a coefficient. 
The relation symbol = denotes congruence rather than identity. 
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2 Basics 



For the machinery employed here, it is convenient to fix a framework with 
five pieces of data: 

• An abehan group (written additively). This will play the role of 
set of all finite expressions. 

• A set i? of maps Ai — > Ai. This can be used to encode a module 
structure on ^A. 

• A subset y A4. This will play the role of set of monomials. 

• A family O = {-B^}^! of subsets of Ai. This will become the funda- 
mental system of neighbourhoods of € 7W and is thus defining the 
topology. 

• A family Ti{S) of maps A4 — Ai, where 3 is the set of all 
expressions. These are what in the end specify the wanted congruence 
on M. 

When applying the diamond lemma to a multiple-sorted structure, there 
will be one such quintuplet [A4, R, y, O, Ti{S)) for each sort, but since the 
core diamond lemma itself is applied separately for each sort, one does not 
have to take this multiplicity into account when proving it. Notation for 
and interactions between different framework quintuplets for a structure are 
considered in Section EJ 

In the main theorem there will also be: 

• a partial order P on 3^; 

but that can without too much difficulty be separated from the rest of the 
machinery, so it will instead be introduced explicitly whenever it is needed. 
Having it separate is sometimes convenient, as one in complicated arguments 
might want to make use of several different orders. Finally, there is in several 
supporting results: 

• a family V of maps M — > A4, which can be used to enforce compat- 
ibility with a multiplicative structure; 

but the typical use of that item is rather on the level of constructing Ti (5*) 
or proving things about it. 

The choices of R, y, O, and Ti{S) are subject to a couple of additional 
conditions, which are specified as assumptions below. Technically it would 
be possible to instead include them as additional conditions in all theorems 
and lemmas that depend on them, but it is more convenient to throughout 
the presentation assume them to be satisfied. There are plenty of suggestions 
for how one may choose the framework data to ensure that the assumptions 
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are met. Ti{S) is treated in the next section, but assumptions on R, y, and 
O are given here. 

Assumption 1. Every element of R is a group endomorphism of M. 

Definition 2.1. A subgroup C is said to be an i?-module if r(a) G N 
for all a € A and r ^ R. 

If Ai has an 7^-module structure for some ring 7?., then it is natural to 
choose as R the set of maps a ^ ra : M. — > M. for all r € 7^; in this case 
the above i?-module concept coincides with the standard 7^-module concept 
and all is as one expects it to be. It may however in some cases be necessary 
to impose a restriction on the elements of TZ which may contribute to R, 
and in that case it is weaker to be an i?-module than to be an 7^-module. 
It is also perfectly possible to take = if no particular module structure 
is available. 

Assumption 2. // A C At is an R-module such that y ^ N then N = M.. 

In other words, 3^ spans M.. The traditional approach is to begin with 
y just being some set, pick some ring TZ, and then construct Ai as the free 
7?.-module with basis 3^; if in particular y is the set X* of words on the 
alphabet X then this will make M equal to the free 7?.-algebra TZ{X). An 
alternative approach for the free algebra TZ{X) is however to let 3^ be the 
set of all terms — products rfi of a scalar r £ TZ and a monomial /j, £ X* — 
as this makes it possible to take R = 0. It is also possible to interpolate 
between these two extremes, or pick a set y with more complicated linear 
dependencies between elements, although the latter is likely to make it more 
complicated to construct Ti{S). 

Definition 2.2. Let R* denote the set of all finite compositions of elements 
of R; in particular, R* is considered to contain the identity map id: A( — > 
M. Let ±R* denote the set {r,-r \ r £ R* }. Let R*y denote the set 
{r(/i) |r G R*,ney} <^M. 

Lemma 2.3. Every element a G Al can be expressed as 



for some n G N, ri, . . . , r„ G ±R* , and //i, . . . , /i„ G 3^. 

Proof. The set of elements on the form (|2.ip constitutes an i?-module that 
contains 3^. Hence by Assumption [2] the set of such elements is the whole of 



Assumption 3. O = {Bn}^^^ is a family of R-modules such that Bn 5 
Bn+i for all n G Z"*" and HJ^i = {0}. 



n 




(2.1) 



fc=i 



M. 



□ 
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Definition 2.4. A set N C M is said to be open (in A4) if there for every 
a (z N exists some e € O such that 

N^{a + b\bee}. (2.2) 

To put it differently: The topology on A4 is the group topology for which 
O is a fundamental system of neighbourhoods of 0. 

Many arguments involving topology in subsequent sections will be ex- 
pressed using e-5-formalism, but since e and 6 will be neighbourhoods of 
rather than the conventional positive real numbers, a few examples of 
what this formalism looks like may be in order. First and foremost, the 
e -neighbourhood of an element o is the set a-|-e:={a-|-6|6Ge}. A map 
/ is continuous at if there for every e € O exists some 5 (z O such that 
f{5) := { /(a) I a G (5 } C e. It should furthermore be observed that continu- 
ity at 0, for group homomorphisms, is equivalent to continuity everywhere 
(and even to uniform continuity everywhere). That 5 is smaller than (or 
equal to) e is of course expressed as J C e, and the minimum of e and 6 is 
end. 

In addition to these general properties of neighbourhood arithmetic, 
there are also some special properties following from Assumption [3] that 
are of great importance here. Firstly e + e = e for every e G O since e is a 
group. Similarly e — e = e and r(e) C e for all r ^ R. Finally the inclusion 
of Bm in Bn whenever m> n implies that e + 6 = eU 6 for all e, (5 G C 

The choice of O is a rather extensive topic, with many different ap- 
proaches that should be mentioned, so it seems best to leave that for the 
end of this section, and instead proceed with the things that can be done as 
soon as O is in place. 

Lemma 2.5. The group operations on A4 and all elements in R are con- 
tinuous. 

Proof. Let s: A4 x A4 — > Ai : {a,b) i— >■ a — 6 be subtraction as a map; 
proving it continuous implies the same for the standard group operations 
addition and negation. Let C 7W be an arbitrary open set and let (a, b) G 
s~^{N) be arbitrary too. Since a — b £ N there exists some e G O such that 
{a-b)+e C A^. For this e, {a+e)-{b+e) = {a-b) + {e-e) = {a-b)+e C A^, 
and hence {a-\-e) x {b-\-e) C s~^{N), which means (a, b) is an interior point 
of s~^{N). It follows that s~^ maps open sets to open sets, and hence s is 
continuous. 

Now let r G i? and an open set A^ C A^ be arbitrary. For every a G 
r^^(N) there exists some e € O such that r(a) + e C A", and hence a + 
e C r~^{N) because r(a -|- e) = r(a) -|- r{s) C r{a) -t e C N. Thus r is 
continuous. □ 
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The next step is to go from Ai to its completion M, which can be 
constructed in the standard way as the set of equivalence classes of Cauchy 
sequences in Ai, where two sequences {a„}J^]^ and {bn}'^^^ are considered 
equivalent if lim„^cx3(an — ^n) = 0. The topology in the completion can be 
defined in terms of limits: a = lim„^oo for o„ = if and only if 

{bn,n}^=i is a Cauchy sequence in M and a = [{bn,n}'^=i] ■ The equivalence 
classes of the constant sequences provide the canonical embedding of A4 into 
its completion, and it is convenient to identify this with the original A4. 

An alternative approach, which fits better in with many textbook defi- 
nitions of the completion, is to turn M into a metric space and make use of 
this explicit metric when defining e.g. the topology of A4. (Both approaches 
yield the same end result.) There are several metrics which all reproduce 
the topology of M, but the following is often the most natural: 

I inf { 2 " I a — 6 G i3„ } otherwise 

for all a, 6 € A^. That d{a, a) = makes use of the infimum, whereas in the 
formula for d{a, b) when a ^b this inf is equivalent to a min. 

Definition 2.6. The completion of M is denoted A^, and M is considered 
to be a subset of M. For any N C M., the topological closure in of iV is 
denoted N. Let 3 = {B^j^^y 

The group operations extend by continuity to the whole oi A4, as do 
the homomorphisms in R, and will henceforth be considered to be defined 
on the whole of M. Accordingly, any subgroup A'^ C Al is said to be an 
i?- module for which r{a) e N whenever a e N and r e R. If Z C A1 is 
some set, then Span(Z) will denote the smallest i?-module which contains 
Z. Denote by Cspan(Z) the topological closure of Span(Z). 

Lemma 2.7. O is a fundamental system of neighbourhoods ofOinAi. In 
particular the elements of O are clopen (simultaneously closed and open), 
whence the topology of M. is zero- dimensional and totally disconnected. 

Proof. First consider an arbitrary F C M. that is closed and not disjoint 
from Bn for any n G Z+; as an auxiliary result it will be shown that such 

an F 3 0. The closure N of some N Q M consists of those points a G Al 
for which there exists some Cauchy sequence {ak}^i Q ^ such that a = 
lim/j^oo C'k- Let {an}'^=i C A4 be a sequence such that an E FHBn and let 
{bn,m}mn=i ^ A1 be a collection of points such that = lim^_»oo bn,m and 
{bn,m}m=i — ^'i for all n. Clearly G Bn for all n and thus bn,n ^ as 
n — > oo, which implies lim^_>Qo a„ = as well. Since F was closed, it foUows 
that G F. 

Now let U C M. be an arbitrary open neighbourhood of 0, and consider 

the matter of whether U contains some Bn for n large enough. The comple- 
ment F = Al \ J7 is a closed set that does not contain 0, and hence by the 
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converse of the above result there is some n for which FCiBn = 0, meaning 
that B^^U. 

Consider next the problem of showing that all the i?„ are clopen (both 
open and closed). A Cauchy sequence {ak}'kLi ^ has the property that it 
is either eventually in Bn or eventually in the complement ^A \Bn, because 
by the definition of Cauchy sequence there exists some m such that if i,j ^ m 
then Oj — Uj G Bn, or equivalently Ui G aj + Bn, and thus if aj G Bn for 
some j ^ m then aj E aj + Bn C Bn + Bn = Bn for alH ^ m, in which case 
{ak}kLi indeed is eventually in Bn- If no Uj G Bn for j ^ m then instead 

G \ Bn for all j ^ m, and consequently {a/cj^i will be eventually in 
M \ Bn- This property of Cauchy sequences means a sequence can converge 
either to an element of Bn or to an element of \ Bn, but not both, and 
therefore these sets will be disjoint; Bn U \ Bn is a partition of M. Since 
both parts in this partition are closed by definition, they are also both open, 
and in particular Bn is both closed and open. 

This has shown that O is a fundamental system of neighbourhoods of 
in M, and that its members are all clopen. A topology is said to be zero- 
dimensional if it has a basis consisting entirely of clopen sets, and every 
space with a zero-dimensional topology is totally disconnected. □ 

Zero-dimensional topologies are, just like Zariski topologies, perfectly 
fine topologies (i.e., all the axioms hold and hence the basic theorems follow), 
but a bit unsettling when one first encounters them as things do not behave 
in quite the way one has gotten used to in the standard topology on M — the 
multitude of sets that are open and closed at the same time being the most 
obvious oddity. Being metric, and consequently Hausdorff, the topology on 
A4 does however have much more in common with the standard topology 
on M than it has with Zariski topologies, so it is not all that far out. 

Exercise. An intuition for spaces like M may be found by comparing them 
to Cantor sets, as the two have many traits in common. Indeed, for M = 
Z2[a;] (univariate polynomials over Z2 = Z/2Z) and Bn = A^x^^^^ where 
a = log3 2, the completion M. is very similar to the standard Cantor set. 
Show that the map (f) : M. — > [0, 1] defined by 



for some positive constants Ci and C2, where |-| is the standard absolute 
value on M and d is the metric from (|2.3|) . Conclude that (p extends to a 
homeomorphism from M. to the remove-middle-third Cantor set on the unit 
interval. 





satisfies 



Cid{a, h) < U(a) - 0(6)1 ^ ^2^(0, b) for aU a,b £ M 
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Lemma 2.8. If A Q M and a € Cspan(yl) then for every e £ O there 

exists a natural number n, some elements {ai}^^^ C A, and some maps 
{n}'^=i ^ ±-R* such that 

n 

^ri{ai)ea + e. (2.4) 

i=l 

Proof. By definition of topological closure applied to Cspan(^), there exists 
some b € Span(^) such that a — b & e. Since the set of all elements on the 
form Y17=i ^i(oj) fo'^ {oj}f=i ^ A and {ri}f^-^ C ±R* constitute an i?- module 
containing A, it follows that b has an expression on that form. □ 

For A = y, this lemma is a topologized version of Lemma 12. 3( but 
Lemma 13. 7| Definition 15.31 Theorem 15.61 and Definition 15.71 all characterise 
important subsets of Ai as being on the form Cspan(74) for a suitable A C 
M. 

The rest of this section is a discussion of some important methods for 
constructing a topology on Ai, i.e., for choosing a system of neighbourhoods 
O. The trivial choice is to let i?„ = {0} for all n; this equips A4 with the 
discrete topology, makes A4 = A4, and simplifies the machinery below quite 
considerably. This is also the choice one should use if one wishes to reproduce 
Bergman's diamond lemma. 

A nontrivial choice of topology which has long traditions in algebra is 
that of an ideal-adic topology. In this case it is assumed that Ai also has a 
multiplicative structure, and as Bi is chosen a nontrivial ideal in A4. Then 
each Bn is defined as the nth ideal power Bf of Bi, i.e., the ideal generated 
by all products of n elements from Bi . Such choices of O allow localisations 
of A4 to be treated within this framework. The condition that all these 
Bn are i?-modules is not necessarily fulfilled for this construction, but it 
follows very naturally when for example M is an 7^-algebra and R is the set 
of multiplication-by-a-scalar maps. Nor is necessarily HJ^i — {0} 
every ideal Bi , but it typically holds for the interesting choices of Bi . 

A generalisation of the class of ideal-adic topologies is provided by the 
'filtered structures' of Robbiano [13j and Mora [TT]. Here it is again assumed 
that is a ring, and a F-filtration {F^j^gr of A4 is given. This means T 
is assumed to be a totally ordered semigroup (written additively, but at 
least in [11] not assumed to be commutative), and the are assumed to 
be subgroups of M. which satisfy: 

(Rl) If 7, 5 G r are such that 7 < 5 then C Fs. 

(R2) F^-Fs<^ F^+s for all 7, 5 G L. 

(R3) For every a € A1 \ {0} the set {7GF|Ky9a} has a minimal element. 
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For actual results, these authors typically also assume that T is 'inf-limited', 
which means that for any infinite strictly descending sequence 71 > 72 > 
73 > • • • in r and any given 7 G T, there exists an n such that 7n < 7- In this 
case, one can simply choose one such infinite strictly descending sequence 
71 > 72 > 73 > ■ • • in r and define O by letting B„ = F^^ for all n S Z+; 
it follows from (R3) and inf-limitedness that Hn^i = {0}. Also observe 
that the resulting topology on M is the same regardless of which sequence 
{7n}^i is chosen. 

The abstract setting of a filtered structure only supports setting R = 0, 
but again the are in many concrete cases modules over a ring of scalars, 
and then it is possible to encode the whole of that ring into R. Otherwise 
it is a rather striking feature of the filtered structure machinery that one 
does not assume any "coefficients" to exist from start, but rather constructs 
them from the filtered structure. Defining 

V-y := U Fs, 

<5er 

<5<7 

G := ^G^ 

7er 

one gets the associated graded ring G that can be used as a coordinatized 
form of hA. Each coordinate then assumes values in the corresponding 
G^\ these groups may vary quite a lot in size and structure, but for reason- 
able cases of M. being an 7^-algebra it often holds that each G-^ is as a group 
isomorphic to either TZ or {0}. It is also common that a filtered structure 
is equipped with a map /: F — > M. such that 7(7) S F^ for all 7 G F (and 
/(7) ^ whenever 7^ i^^), although that map is not part of the formal 
definition. The image of such an / is typically the primary candidate for y ^ 
even though there is nothing in the generic formalism from which one may 
deduce that this image should span M.. 

Even more general is the approach to define O as a family of balls with 
respect to an ultranorm |[-|| on M.: 

S„ = {a e 7W I ||a|| < 2~"} for all n e Z+. (2.5) 

In one sense this construction is universal, because if (D — {Bn}^—i is given 
then one can always use (12.30 to reconstruct a norm ||a|| = d(o, 0) that 
in (j2.5p would give rise to the original neighbourhood system O, but more 
important is that it often provides a convenient method for arriving at a 
topology with desirable properties. 

The standard construction of an ultranorm in the case M = ^{X) is to 
pick any function w : X — > M and define the ultranorm || • || on 3^ = X* to be 
the unique monoid homomorphism y — > that satisfies ||x|| = 2"'^^) for 
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all X £ X; in other words ||nr=i ■~ Y\7=i 2"'^^"-' for all X1X2 ■ ■ ■ Xn & 
This is then extended to the whole of A4 by defining 

||a|| := max \\fi\\ where Z G y is minimal such that a € Span(Z), (2.6) 

fiez 

and in particular letting ||0|| := 0, as a sort of = max0. The w is 
known as the weight function for the norm, and its sign determines how 
the formal variables behave; if w{x) ^ then x will be a polynomial-style 
variable, whereas if w^x) < then x will be a power-series-style variable. 
The logarithm of ||-|| behaves as a weighted polynomial-style degree function 
onM = TZ{X). 

Definition 2.9. Formally, a function a 1— s- ||a|| : M — > M is said to be a 
(group) ultranorm if 

(i) ||a|| ^ for ah a £ M. 

(ii) \\a — h\\ ^ max{||a|| , ||6||} for all a, 5 G M. 

(iii) ||a|| = for some a € A1 if and only if a = 0. 
If is a ring and in addition 

(iv) \\ab\\ ^ ||a|| ||6|| for ah a,b e M 

then ll'll is said to be a ring ultranorm. If instead 7^ is a ring with 
ultranorm |-| and A4 is an 7^-module, then a group ultranorm ||-|| on A4 is 
said to be a module ultranorm if 

(v) \\ra\\ ^ |r| ||a|| for all r G 7?. and a € A4. 

An algebra ultranorm has to satisfy all of (i)-(v). 

The 'ultra' prefix pertains primarily to property (ii) — the strong triangle 
inequality — and in particular to its right hand side max|||a|| , ||6||}, which 
is more strict than the ||a|| + ||^|| of the ordinary triangle inequality. Among 
the direct consequences of (ii) are that any e-neighbourhood of — i.e., any 
set of the form |aGAl| ||a|| <e} for a real number e > — is a subgroup 
ofM. 

The trivial ultranorm has ||0|| = and ||a|| = 1 for all a 7^ 0; it exists for 
all groups and reproduces the discrete topology. 

If A4 is an 7^-module and the ultranorm ||-|| on A4 satisfies ()2.6p then 
equipping TZ with the trivial ultranorm will make ||-|| an 7^- module ultra- 
norm. This is typically the "correct" scalar norm for a formal power series 
problem, as all nonzero scalar values are then equivalent for matters of series 
convergence. Conversely it is often convenient to define the norm on A4 so 
that it becomes an 7^-module norm with respect to some given norm |-| on 
TZ. In the particular cases where y is an 7^- module basis for Ai, then one 



12 



may choose to make \\rfj,\\ = \r\ ||//|| for all r € 7^ and fi £ y, which has its 
advantages when it comes to defining Ti{S) below. Non-trivial scalar norms 
may however require nontrivial choices also of R and y. 

What complicates the choice of a scalar norm is the assumption that 
each Bn is an i?-module, which given ()2.5p is equivalent to the claim that 
||r(a)|| < 2^" for all r G i? and a £ Ai satisfying ||a|| < 2^'". One would 
typically ensure this by enforcing the condition that ^ ||a|| for all 

r £ R and a £ ^A, and this will indeed be the case provided r : Ai — > M. is 
a map on the form o i— > sa for some scalar s £ TZ such that |s| ^ 1, as then 
||r(a)|| = \\sa\\ ^ |s| ||a|| ^ ||a||. Hence the natural choice of R when A4 is 
an 7^-module with a ditto ultranorm is to take 

R= {a^ ra : M — > M\r £ TZ and \r\ l] . (2.7) 

Unless all scalars r £ IZ satisfy \r\ ^ 1, this will make the i?- module con- 
cept distinct from that of an 7^- module however, and this has repercussions 
elsewhere, y must span A4 as an ii-module, so if Y is an 7^-module basis of 
M then y may have to be chosen as something like the set of all products 
rjJL for r £ TZ and fj, £Y in order to make it all fit. 

A case where this predicament arises is that of A4 being a module over 
the p-adic numbers Qp, as these come equipped with an ultranorm (the p- 
adic valuation) that has \p"'\ = p~" for all n G Z. The R defined by ()2.7p 
for TZ = Qp is isomorphic to the p-adic integers, but not to the entire field 
of p-adic numbers; conversely any Bn defined by p.5p will fail to be closed 
under multiplication by the scalar p~^ and is thus not a Qp-module although 
it will be an i?-module. A Qp-module basis Y for Ai will in this case not 
be large enough to serve as 3^. One can instead use the set of all terms as 
suggested above, but since p~^ together with the p-adic integers generate 
the whole of Qp, it is also sufficient to make y the set of all products p~"/i 
for n £Y and n £TL (or even n G N). It is typically easier to construct the 
partial order P on 3^ if the latter has a simple, discrete structure. 

3 Reductions 

While the purpose of introducing A^, i?, 3^, and O is primarily to fix and 
structure a stage for the diamond lemma, T\{S) is what provides the ac- 
tors on that stage, so that a play of equivalence and normal forms may be 
performed. The relation of Ti(S') to the equivalence of elements in ^A is 
primarily that these maps preserve it — t{a) must be equivalent to a for 
all t £ Ti{S) and a £ M. — but as the equivalence concept is derived from 
Ti{S), this is a theorem rather than an assumption. The role of the collec- 
tion Ti{S) of maps Ai — > Ai is mostly that of a presentation: it is not 
uniquely determined by that which it is used to define, but it gets the job 
done, and you may use it to verify at least some conditions about the whole. 
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In applications one rarely starts from Ti(S) — hence the somewhat odd 
notation; Ti(S) is typically constructed from a more fundamental set S of di- 
rected equivalences, a so-called rewriting system — but for this general proof 
it provides the best balance between abstract adaptability and concrete con- 
structibility. As in the previous section some examples will be given below 
of how Ti (S) can be constructed from such a more fundamental S, but this 
should be taken more as hints than as a full survey; it is sometimes neces- 
sary to combine several different methods of construction. That applications 
typically make some S the fundamental entity has however influenced the 
choice of notations below, in that every object that formally depends on the 
choice of Ti{S) is written as though it would depend on 'S". Besides being 
more convenient in applied instances of the diamond lemma, this choice of 
notations also simplifies comparisons with [7]. 

Definition 3.1. Let T(S) be the set of all finite compositions of elements 
from Ti{S); in particular the identity map id: ^A — > M. is considered to 
be an element of T{S), on account of being the composition of an empty 
sequence of maps in Ti(S). The elements of T{S) are called reductions 
and the elements of Ti (5) in particular are called simple reductions. 

The name reduction suggests that these maps take something away, and 
this is indeed typically the case. Standard constructions of reductions tend 
to make them more or less projections, and although there is no formal 
need for them to be, it may be helpful on a first reading to think of them 
that way. It should however be observed that even those reductions which 
really are projections tend to be rather skew and have very small kernels, so 
don't expect to use just one and be done with it; getting anywhere is much 
more like a round of golf, where one has to hit the ball repeatedly (typically 
making use of many different clubs) in order to get it into the hole. 

The distinction between simple and non-simple reductions is mostly in 
the eye of the beholder, because nothing prevents picking as Ti{S) a set 
of maps that constitute a monoid under composition, in which case one 
would have T{S) = Ti{S) and all reductions would be simple. The point of 
letting the user designate some reductions as being simple is that it is often 
sufficient to verify a condition only for the simple ones, as the property in 
question easily extends to all reductions. 

Assumption 4. Every simple reduction t € ^1(5) is a continuous group 
homomorphism M — > M which satisfies t o r = r ot for all r G R. 

Reductions may of course satisfy this property for a larger class of maps 
than R; they may for example all be 7?.-linear for some ring TZ that is larger 
than R. Therefore many lemmas below that say some set is an i?-module 
will have a 'more generally, . . . ' clause which covers the case of additional 
maps r that commute with all reductions. 
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Lemma 3.2. Every reduction t £ T{S) is a continuous group homomor- 
phism A4 — > M which satisfies t o r = r o t for all r £ R* . 

Proof. The identity map id € T{S) trivially satisfies the properties in the 
lemma. All other reductions are finite compositions of simple reductions, 
and since the composition of two continuous group homomorphisms is a 
continuous group homomorphism, it follows from Assumption S] that all 
reductions are continuous group homomorphisms. Finally if t = t„o- • -oti for 
1 1 , . . . , t„ G Ti (S) and r = o • • • o n for ri , . . . , € -R, then t j o rj = rj o ti 
for all i and j by Assumption [H whence t o r = tn o ■ ■ ■ o ti o r^ o • ■ ■ ° ri = 
rm ° ■ ■ ■ o ri o tn o ■ ■ ■ o ti = r o t. □ 

Lemma 3.3. A reduction is uniquely determined by its values on y. 

Proof. Let t G T{S) be arbitrary. Since t is continuous and M is dense in 
Ai, the values on M. uniquely determine t. By Lemma \2.'d\ any a £ A4 
can be expressed as a = X^fc=i '"fcC/^fc) for some n G N, ri, . . . ,rn € ±R*, 
and iJi,...,Hn^y- Hence t{a) = J2k=i^{'^k{fJ^k)) = Ylk=i'^k{t{nk)) , which 
expresses t{a) purely in terms of the values on 3^ of t. □ 

The definitions of reductions are accordingly often simplified to stating 
how they act on elements of y. A common approach is to define simple 
reductions so that they change precisely one element of y while leaving all 
other elements the same. Concretely the simple reduction which changes 
H £ y to a £ ^A would be defined by 

t^^aiX) = h '^^ = ^'' for all AG 3^. (3.1) 

I A otherwise, 

Such a map satisfies t^^a{b) = b for all b G Cspan(3'\ {/i}) , so if in addition 
a G Cspan(3'\{/u}) (which will be hard to avoid while satisfying the compat- 
ibility condition of Definition 15. 3p , then the image of tf^^a will be contained 
in Cspan(3^ \ {fi}) and consequently this map becomes a projection. Its 
kernel is however as small as it can possibly be without being trivial, and 
the a is only rarely zero, which means the projection is typically skew. 

While (13. ip is the standard definition of a simple reduction from a con- 
ceptual point of view, it is not obviously one which is formally sound; if for 
example y is not an independent set in M then a map t^^a for arbitrary 
IJ, € y and a £ A4 can probably not both be a group homomorphism and 
satisfy ()3.ip . An alternative definition of t^^a , which often is better suited 
for proving properties of this reduction, is 

tf^^aib) = b - f^{b) -{fi-a) for all b G M. (3.2) 

Prerequisites for this formula is that M. is some sort of 7^-module that fur- 
thermore comes with coefficient-of-/U homomorphisms : M. — > TZ for all 



15 



// G 3^; if these satisfy //^(/i) = 1 and /^(i^) = for all e 3^ \ {fj.} then 
p.ip becomes an immediate consequence of ()3.2p . Assuming the 7^-module 
operations on are continuous, the continuity of t^^a is furthermore im- 
plied by the continuity of the coefficient function and this depends only 
on the choices of Ai, R, y, and O. With ultranorms defined using (12. 6|) . 
the continuity of these maps is typically something one gets for free [3 
Ssec. 2.3.2]. See also Lemma [7?T1 

Formulae like ()3.2p can often be used to define the simple reductions even 
in cases where (|3.ip leads to contradictions due to dependencies between 
elements of y. One example of this is the situation that is a free TZ- 
module with basis Y, but R is less than TZ and y therefore has been chosen 
as the set of all multiples rfj, for r € 7^ and n If the range of n in ()3.2p 
is restricted to Y then this formula still makes perfect sense, but the result 
is of course rather a map t^j^a satisfying 

\ ra if A = r/i for some r & TZ, „ „ , 
t^^a(A) = <^ ^ ^ ^ for all A G 3^. 

I A otherwise, 

The underlying idea for all these definitions of a reduction t^i-^a is to distin- 
guish the part of a general element of A4 that corresponds to the particular 
undesired element /x of 3^, and then replace this part by something it is 
equivalent to. This process is often straightforward for concrete problems, 
even though it may seem difficult to formalise in general. 

Definition 3.4. A reduction t G T(S) is said to act trivially on some 
o G if t{a) = a. An element a G is said to be irreducible (with 
respect to T{S)) if all t G T{S) act trivially on it. The set of all irreducible 
elements in Ai is denoted Irr(S'). Also let 



I{S) = ^ { a - t{a) \a£M} (3.3) 
teT{s) 

and write a = b (mod 5) for a — 6 G I{S). An a G Irr(S') is said to be a 
normal form of 6 G if a = 6 (mod S) . 

The main theme in the next two sections is to define a projection of 
A4 onto Irr(5) that constitutes a kind of pointwise limit of T{S), and then 
demonstrate that 2{S) is the kernel of that projection; from this will follow 
that there exists a unique normal form (which is computed by the map t^) 
for every element of A4. When it all works out, there is an equivalence 

a = b (mod S) <^ t^{a) = t^{b) for all a,b£M, (3.4) 

where the right hand side is algorithmic in style and well suited for calcu- 
lations, whereas the congruence relation = in the left hand side is identi- 
fiable as the reflexive-symmetric-transitive-algebraic-topological closure of 
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'a = t{a) for all t € Ti{S) and a G M\ A problem that is difficult on one 
side of the equivalence may have an obvious solution when transported to 
the other side of it; the main direction for decision problems is left to right, 
whereas identities tend to be simpler to derive on the left side. Applied 
calculations often focus on the irreducible elements, because the set Irr(5) 
can be used as a model for the quotient set M.j= (mods'). 

It should be pointed out that this concept of irreducibility has nothing 
to do with multiplicative irreducibility (the property that the only factorisa- 
tions of an element are the trivial ones), nor for that matter with for example 
join-irreducibility (which in lattice theory is the equally important property 
that an element cannot be expressed as the V of two other elements), but 
the point about multiplicative irreducibility needs to be stressed since many 
algebraists are accustomed to interpreting an unqualified 'irreducible' as re- 
ferring to precisely multiplicative irreducibility; indeed this tendency is so 
strong that many authors seek other names to use for this concept. One such 
synonym is normal (which in this sense most commonly occurs in the phrase 
'normal form'), that unfortunately also has the alternative interpretations 
"having norm 1" and "being orthogonal to tangents", which are quite dif- 
ferent. Another synonym is terminal, which refers to the fact that reduction 
stops when reaching one of these elements — however in this topologized set- 
ting reductions do not in general stop completely; they merely "slow down" 
when approaching the limit. 'Irreducible' is the term that is used in |2j and 
must therefore be considered established and standard. 

Lemma 3.5. An element of A4 is irreducible if and only if every simple 
reduction acts trivially on it. The set Irr(5') is a topologically closed R- 
module. More generally, any continuous group homomorphism r : A4 — > M 
satisfying r o t = t o r for all t S T(S) maps Irr(5) into itself. 

Proof. Clearly every simple reduction acts trivially on an irreducible ele- 
ment. Conversely every non-simple reduction is a composition of simple 
reductions, and if all of these act trivially on an element, then the compos- 
ite reduction must do so too. Hence all elements which simple reductions 
act trivially upon are irreducible. 

Let t € T{S) be arbitrary. Then the set It of all 6 € A1 such that t{b) = b 
can alternatively be characterised as the kernel of the map t' : Ai — > Ai 
defined by t'{b) = t{b) — b. Since i is a continuous group homomorphism, 
t' will be one too, and thus the set It will be a subgroup of Ai. Moreover 
It is topologically closed since it is the inverse image of {0}, which is a 
closed set. If r is a homomorphism commuting with t then for any b It, 
= r(t'{b)) = (rot) (6) —r{b) = {tor){b) — r{b) = t'(r(6)) , and hence r maps 
It into itself. Finally Irr(5') = \^teT{S) hence Irr(S') must also be a 

topologically closed i?-module, since these properties are preserved under 
arbitrary intersections. □ 
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There is a similar set of basic properties that hold for the complementary 
set T{S) of elements equivalent to 0, but before going into that it is conve- 
nient to expound a bit on some additional twists in the usual construction 
of simple reductions. It was said above that S could be a set of "directed 
equivalences", usually known as rewrite rules or simply rules. Concretely 
these rules may be expressed as pairs (/x, a) € 3^ x where // (the 'princi- 
pal' or 'leading' part) is to be replaced by a. The homomorphism t^^a as 
constructed in p.ip or (13. 2p implements this replacement, but Ti{S) typi- 
cally contains more than just those t^^a maps for which (/x, a) G S\ there 
will also be reductions which arise from placing the basic rule into various 
contexts. This produces reductions that apply in cases where ^ occurs as a 
part of a larger expression. 

In the classical case of Bergman's diamond lemma, where Ai = T^{X) 
and 3^ = X* , this means that a pair (/i, a) € S should not only give rise 
to a simple reduction which maps ^ to a, it should also for every multiple 
z/i/xf2 of /i give rise to a simple reduction designed to map that to the 

corresponding multiple i/iaz^2 of a. Thus if one defines 



for all {iJ,s,as) = s £ S and A,z^i,i^2 £ y, then the corresponding construc- 
tion of Ti (S) is 



In this case, it follows that a monomial A G X* is irreducible if and only if 
it is not a multiple of fig for any s G S*. If S" is finite then the irreducible 
words furthermore constitute a regular language (i.e., they can be described 
by a regexp), although infinite rewriting systems are unavoidable for some 
equivalences. Conversely, it would typically not be possible to make do with 
a finite system S unless there was this kind of "put rule into all possible 
contexts" mechanism manufacturing an infinite family of reductions from 
every concrete rule. Though this twist is not a technical necessity, it is in 
practice very helpful. 

The basic idea of putting rules into all possible contexts remains useful 
in general, but beyond associative algebra it quickly becomes difficult to ex- 
press concretely in elementary notation. The abstract form of this construc- 
tion is that one has a set V of continuous homomorphisms A4 — > A4 , which 
furthermore map y into itself, and defines simple reductions t^^g through 




z^iasi^2 if A = vi^isV2 
A otherwise. 



(3.5) 



Ti{S) = {t. 



UlSU2 



i'i,i'2 ^ y,s £ s} . 



(3.6) 



t 



■v,s 




(3.7) 
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for all (fisjCLs) = s £ S and v £ V. The set of maps which gives rise to the 
classical case described above is 

V = {bh^ i^ibu2}uuu2<^X', (3.8) 

but one can also consider other families — see below for some examples. 

If V (as above) is a monoid under composition then the construction 
(j3.7p has the effect that there for every such reduction t^^g, every w & V, and 
every X £ y exists another reduction t^aiov,s which satisfies twov,s{w{X)) = 
w{tv,sW) , and if w is injective then it's even t^ov,s°u! = wot^ g. This turns 
out to be a very useful property, so it deserves a name. 

Definition 3.6. A map v. A4 — > Ai is said to be advanceable (with 
respect to Ti{S)) if there for every t G Ti{S) and b € R*y exists some 
u £ T{S) such that u(f(6)) = v(t{b)). The map v is said to be absolutely 
advanceable (with respect to T{S)) if there for every t G T{S) exists some 
u G T{S) such that v o t = u o v. For contrast, the ordinary advanceability 
may also be called conditional advanceability. 

Advanceable maps provide a way to reason about and take advantage 
of the kind of structures in the set of reductions that arise from using (j3.7p 
or some variation thereof. The name comes from the point of view that 
if an advanceable map v and a reduction are both to be applied to some 
element, then one can always arrange things so that v is applied before the 
reduction (it can be advanced past a reduction), even if that may come at 
the price of having to apply a different reduction. A point worth noticing is 
that absolute advanceability only needs to be checked for simple reductions, 
as a map v can be advanced past ti o t2 if it can be advanced past ti and t2, 
whereas a conditional advancement need not have this composition property; 
different terms of t2(&) may call for different translations of ti when one tries 
to advance v past ti. It is however the conditional variant that in practice 
is most important. 

Absolute advanceability can be viewed as a weaker form of the 'more gen- 
erally' condition in Lemma \3.5\ in that it doesn't require the two reductions 
to be equal; this point of view is employed in the next lemma. Another way 
of expressing this condition is that v o T{S) C T{S) o v, and therefore some 
may prefer to describe an absolute advanceable map as being an element of 
the left-normaliser of T{S), but that characterisation seems difficult to use 
for conditional advanceability. Furthermore the characterisation as element 
of the left-normaliser breaks down in the more general case of a multi-sorted 
structure; Definition 16.11 gives the whole story. 

Lemma 3.7. The set T{S) is a topologically closed R-module. More gener- 
ally, any advanceable continuous group endomorphism on M. maps T{S) into 
itself. Any reduction maps T{S) into itself, and in particular kert C I{S) 
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for all t € T{S). Furthermore 



X{S) = ^ {a- t{a) \aeM]= ^ {a- t{a) | a G 7W } = 

iGTi(S) teTi(5) 

= Cspan(^{ - t{^l) I /i e 3;, t G ri(5) }) . (3.9) 

Proof. Let t G T{S) be arbitrary. For any b,c A4 one finds that 

{b-t{b))-{c-t{c)) = h-c-t{h)+t{c) = {b-c)-t{b-c) € {a- t{a) | a G M} 

and hence Nt := {a — t{a) | a G } is a subgroup of M. Obviously the 
sum of a family of groups is a group, and the closure of a group is a group 
because the group operation (addition) is continuous. Hence I{S) is a group, 
and it is topologically closed by definition. From the similar observation that 
any a G ker i satisfies a = a — t(a) G A^t C I{S), it follows that keri C I{S). 

Now let r: A4 — > A4 be an absolutely advanceable continuous group 
homomorphism. Let t G T{S) be arbitrary and choose some t' G T{S) such 
that r ot = t' or. For any 6 G one finds that 

r[h - t{b)) = r{b) - r{t{b)) = r{b) - t'{r{b)) G Nf 

and hence r maps Nt into Nti. It follows from the fact that r is a homo- 
morphism that r maps N = Y2t€T{s) ^^^^ itself, and then from the fact 
that r is continuous that it maps I{S) = N into itself. Since in particular 
all r G i? are absolutely advanceable, it follows that I{S) is an i?-module. 

Again let t G T{S) be arbitrary and consider the matter of whether t 
maps 1(3) into itself. For any t' G T{S) and b G Nf/ it holds that b = a—t'{a) 
for some a G A^, and thus 

t{b) = t{a) - {to t'){a) = a - {tot'){a) - a + t{a) G Ntot' + Nt C X{S). 

Hence t{Nt') C I{S) for all t' C T(S) and this extends as above to arbitrary 
elements of T(S'); an arbitrary reduction t G T{S) maps I{S) into itself. 

For the claim that I{S) can be constructed from the Nt groups of simple 
reductions, one may first observe that A^'id is just {0}, and thus does not 
contribute anything unique to I{S). Any other nonsimple reduction t G 
T{S) is a finite composition t„ o • • • o = t of simple reductions ti, . . . , G 
T{S) and it holds that Nt C YL^=i j because ii Uk = tk o ■ ■ ■ o ti for k = 
1, . . . , n then any a—t{a) G Nt can be written as a—ti{a)+ui{a)—t2 (ui(a)) + 

h tin-i(a) - tn{un-iia)) € Nt^ + Nt^ -\ h Nt„. Hence Y.teT{S) ^ 

YlteTi{s) terms for simple reductions suffice for producing the total 

sum. 

Define Mt := {a- t{a) | a G Al } for all i G Ti{S). The last equality 
in (13. 9p follows from the observation that Mt = Span( { — t{^) | G 3^ } ) 
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for all t G Ti(5). In the middle equality, the ^ inclusion trivially follows 
from Nf 2 Mt- For the reverse inclusion, let b in the closure of Y2t<=:Ti{s) 
be given. Let e € O be arbitrary. Since 6 + e is a neighbourhood of b it 
contains some element of J2teTi{S) there exists a finite U C Ti{S) 

and {at}teu ^ such that 6 — '^tsui^t ~ ^i^t)) £ £■ Let 5 G O be such 
that S C e and t((5) C e for all t e U. Let {Qjtgf/ C M he such that 
at — Ct & d for all t £ U. Then 

^(ci-t(ci)) = '^{at-t{at))+'^{ct-at)-'^t{ct-at) G 6+e+(^-e = 6+e 

t€U t€U t€U teU 

and hence b, by the arbitrariness of e, is in the closure of ^^teTiiS) 

The claim that also a conditionally advanceable continuous homomor- 
phism V will map I{S) into itself is now an easy consequence of (13. 9p : by 
continuity and since w is a homomorphism, it suffices to show v{b) G I{S) 
for arbitrary b G Span({/i — fi G y, and t G ri(5). Let such 6, fi, 

and t be given. There exist {ri}f^i C iti?* such that b = Y17=i ^«(/^ ~ 
and G T{S) such that (ti o v o ri){fi) = (^v o t o ri){fj,). Thus 

^r,(^-t(^))) = J^(t;(r,(/i)) -t;(ri(t(/i))) 

1=1 ^ ^ 

□ 

In the case of Bergman's diamond lemma, where all maps on the form 
(j3.8p are advanceable, this lemma implies that I{S) is a two-sided ideal: 
it is closed under addition, multiplication by a scalar, and multiplication 
on either side by an arbitrary generator of the algebra Ai = TZ{X), so by 
distributivity it is closed under multiplication by arbitrary elements. The 
letter 'T' was chosen in anticipation of this, since T is the initial of 'ideal', 
but it is by no means restricted to two-sided ideals. 

Definition 3.8. Let y be a set of maps A4 — > Ai. A nonempty i?-module 
N C A4 is said to be a l^-ideal if it is topologically closed and v{a) £ N for 
all V £ V and a £ N. A set A C is said to be a V-ideal basis for N if 

= CsTpan{{v{a)}aeA,vev) ■ 

From a minimalistic formal perspective the 'T^-ideal' concept is unneces- 
sary, as it is equivalent to 'topologically closed (i?U l/)-module', but (apart 
from being shorter) the 'y-ideal' terminology has the advantage of being 
closer to the familiar terms 'left ideal', 'right ideal', and 'two-sided ideal' 
of which 'F-ideal' is a common generalisation. Furthermore, it is 'y-ideal 
basis' that is the more important concept in the above definition, since that 
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is a first step on the way to defining a Grobner basis. Ideal bases have the 
right associations for this, whereas any combination of 'module' and 'basis' 
is likely to give rise to incorrect expectations about independence between 
basis elements. 

Besides preparing for subsequent developments, this definition also gives 
an opportunity to summarise some of the above constructions of simple 
reductions into a formal statement that actually claims something, even if 
it isn't very much. 

Corollary 3.9. Let TZD R be a unital ring of continuous endomorphisms on 
that is equipped with a topology such that the TZ-module action IZxM — > 

Ai : {r,b) i— > r(6) =: r ■ b is continuous and TZ is complete. Assume M is 

a free TZ-module with basis y such that each coefficient- of- fi homomorphism 

ffj_ : M. — > TZ is continuous. 

If V is a monoid of continuous TZ-module homomorphisms A4 — > M 

that map y into itself then the following holds for all S ^ y x A4: 

1. For any v and s = {fis,as) € S, the map t^^s- -M. — > Ai defined 
by 

t^,s(.b)=b- f^^^^j{b){v{fis)-v{as)) forallbeM (3.10) 
is a continuous homomorphism that commutes with all elements ofTZ. 

2. If Ti{S) = {tv,s}v<=:V,s&s then every v is advanceable and the set 
{Hs — cis}ses is a V -ideal basis forZ[S). 

3. Any injective element of V is absolutely advanceable. 

Proof. The definition (I3.10p of t^^g is clearly a composition of maps that 
by assumption are continuous homomorphisms. Furthermore i„,s(r • b) = 
r-b- f^(^^){r ■ b) ■ v{fis_^ Os) = r ■ b - {r o /^(^^)(6)) • v{ns - Os) = r ■ ty^s{b) 
for all r G 7^ and 6 G since fv{fis) 7^- module homomorphism and • 

is a left module action. Hence t^^s o r = r o t^^g. 

By the last part of (13. 9|) . T{S) is spanned by all A — t{\) for A € 3^ 
and t G Ti(5), i.e., all A - tt,,s(A) = A - A + /t,(^,)(A) • v{^is - cls). This 
is unless A = v{fis), in which case it is equal to v{fis — o-s)- Hence 
Cspan({7;(/Us - as)}v<^v,ses) = ^{3). 

In order to show that w & V is advanceable, let t^^s £ Ti{S) and X £ y 
be arbitrary. If A = vins) then 

w{t^^s{X)) = w(^X - /t,(^,)(A) • v{fj.s - fls)) = w{v{fis) - v{iis - as)) = 
= {wo v){as) = {wo v){fis) - {wo v){ps - Os) = 

= W{X) - f(n,ov)(p,){{w ov){Hs)) ■ {wov){fls - As) = twov,s{w{X)) 
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and since w, t^^g, and twov,s are all 7^- module homomorphisms it follows that 
w(tv,s{i" • A)) = twov,s{'w{r ■ A)) for all r ^ TZ. If instead A 7^ v{fj,s) then 

w{t^,siX)) = w(^X - fv{^,s)W ■ ^(Ms - as)^ = w{X) = id('u;(A)) 

and since w, t^^g, and id are all 7^- module homomorphisms it follows that 
w(tv,s{f • A)) = id(u;(r • A)) for all r ^ TZ. Either way, w can be advanced 
past ty^s and hence w is advanceable. 

If w is injective then w{X) = {w o v){^s) if and only if A = and 
hence f(wov)(tis) ° w = fv(f,^). In this case w{ty^s{b)) = tj^ov,s{'wib)) for all 
beM. □ 

In the case A4 = 71{X), the choices of V that give rise to one-sided ideals 

are 

V = {b^ ub}y(:x* (left ideal), 

V = {b ^ bv}yi^x* (right ideal); 

multiplication by a non-monomial element of M does not map y = X* 
into itself and can therefore not be used with Corollary 13.91 In the fourth 
classical case that M is an 7^[X]-module and one wants T{S) to be an 
submodule, the right choice is to make V the set of all maps b 1— > ^'H^ex ^""^ 
for {nx}x&x C N, i.e., the set of maps that multiply by power products on 
X. 



4 The limit of all reductions 

The head-on approach for defining the sought projection of M. onto Irr(S) 
would be to immediately seek a map Ad — > Irr(5), but often a subtler 
approach is more convenient. The route taken here is to (i) define subsets 
of M. where the collective of reductions has nice properties, (ii) use those 
properties in the definition of , and only afterwards (iii) show that the 
subsets defined in (i) are in fact the whole of M; the same approach was 
followed in [2]. Steps (i) and (ii) are carried out in this section, whereas 
step (iii) is the subject of the next. 

The definition of as a pointwise limit of T{S) would be to say that 
t^{a) is the element of Irr(5) that is a limit point of { t(o) 1 1 G T{S) }; m a 
discrete topology, this simply says that t^{a) is the element of Irr(S') which 
is equal to t{a) for some t G T{S), but in general one will have to make do 
with being able to get arbitrarily close to some irreducible element. For this 
idea to work as a definition of t^{a) it is of course first necessary that such a 
limit point exists (and second necessary that it is unique), but the existence 
condition becomes much more convenient if one strengthens it a bit. 
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Definition 4.1. An a ^ M is said to be stuck in N <^ M under T{S) 
if t{a) £ N for all t € T{S). Given an e € O, an a G is said to be 
persistently e-reducible under T(S) if there for every ti G T{S) exists 
some t2 € T{S) and b G Irr(5) such that ^2(^1(0)) is stuck in 6 + e. 

If a is persistently e-reducible for all e G O then a is said to be per- 
sistently reducible. The set of all elements in M that are persistently 
reducible under T{S) is denoted Per(S') and the set of all elements in A4 
that are persistently e-reducible under T{S) is denoted Per£(5). 

The extra power added in this definition is that the wanted outcome of 
being close to Irr(5) should persist no matter what has to be done before 
or after the t2 reduction chosen to get there: o G Pere(5) if and only if 
there for every ti G T{S) exists some t2 G T{S) and b G Irr(S') such that it 
holds for every ts G T{S) that (ts o t2 o ti)(a) — 6 G e. The corresponding 
property in [2] is 'reduction-finiteness'; see ^ Ssec. 3.1.2] for a comparison 
and analysis of the two. 

Lemma 4.2. For each a G Per(5'), there exists some b G Irr(S) such that 
for every e €z O there is some t G T(S) such that t(a) is stuck in b + e. 
Furthermore a and b are such that a — b£ ^(S), and hence Per(S') C Z{S) + 
1tv{S). 

Proof. Construct a sequence {uij^o — '^i^) letting uq = id and recur- 
sively defining Un for n > as follows: by persistent reducibility of a there 
exist tn G T{S) and 6„ G Irr(5') such that tn(un-i{a)) is stuck in 6„ + Bn, 
therefore let Un = tnO Un-i- 

For any n, i,j G N such that i > j ^ n, the element Ui{a) is in 6j + Bi as 
well as in bj + Bj , which means 

bi - bj = [bi - Ui{a)) + {ui{a) - bj) € Bi + Bj C 

Hence {6„}^o is a Cauchy sequence in Irr(S') and consequently it converges 
to some b G Irr(S'). Fix some n G N such that Bn ^ e and consider 
limj^oo(^i — bn). Since all elements in this sequence are in the closed set Bn 
it follows that the limit b — bn ^ Bn ^ £. Thus 6„ + e = 6 + e and Un{a) is 
stuck in 6 + e as claimed. 

To see the second claim, observe that Un{a) b as n 00. For 
any n G N, a — Un{a) G I{S) by definition and consequently a — b = 
lim„^oo(a — Unici)) S ^{S) as well. Hence a G 1{S) + Irr(S'). □ 

This proof highlights a subtle point in the basic set-up of Section [2] which 
may be regarded as a restriction, namely that the family O = {Bn}^^i 
must be countable: the construction of {un}'^=Q would not necessarily suffice 
for demonstrating convergence if O was uncountable. In other proofs it is 
possible to treat O as an arbitrary collection of neighbourhoods, which might 
suggest a generalisation to topologies defined by an uncountable O is not 
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unreasonable, but on the other hand the countabiUty of O is in this lemma 
closely tied to the status of T(S) as a set of finite compositions of elements 
of Ti{S), and that is something several proofs rely on. Hence removing 
the condition that O is countable would probably require a more powerful 
construction of reductions than as mere compositions of simple reductions; 
it is presently not something for which I see any precedence. 

Besides this existence of a limit point property, it is also important that 
the set of persistently reducible elements is closed under algebraic opera- 
tions. This exercises the slightly different aspect of persistent reducibility 
that one can find reductions which simultaneously take several persistently 
reducible elements close to their normal forms. 

Lemma 4.3. The set Per(5) and for every e € O the set Per£(S') are R- 

modules. More generally, Per(S') is mapped into itself by every continuous 
group homomorphism A4 — > M which commutes with all reductions. 

Proof. Let e G O be given. Clearly Irr(5) C Per£(S'), and hence to see 
that the latter is a group, it suffices to show for two arbitrary elements in 
it that their difference is also in this set. Therefore let 01,02 G FeTs{S) and 
ti G T{S) be arbitrary. There exists some t2 G T{S) and bi G Irr(S') such 
that ^2(^1(01)) is stuck in bi + e. There also exists some t^ G T{S) and 
62 G ^(6") such that i3((i2 ° ti){a2)) is stuck in 62 + e. Now let t^ G T{S) 
be arbitrary. Since 

t4{{t3 0t2){ti{ai-a2))) -(61-62) = 

= (^(t4 0t3)(i2(il(ai))) - (^t4(t3((i2 0ii)(a2))) -62^ G £ + £ = £, 

it follows that t4^(t3 o t2)(ti{ai — 02))^ S (61 — 62) + £• Hence the element 

(^3 ° t2){ti{ai — 02)) is stuck in {bi — 62) + £, and by arbitrariness of ti it 
follows that oi — 02 G Pcr£(S'). 

Now let r : A4 — > M be an arbitrary continuous group homomorphism 
which satisfies r o t = t o r for all t G T{S), and let 5 G O be such that 
r{S) C £. Let a G Per5(S') and ti G T{S) be arbitrary. Let t2 G T{S) and 
b G Irr(S') be such that (t2 oii)(a) is stuck in 6-|-£. Then for any G T{S), 

{hot2oti){r{a)) -r{b) =r{{t3ot2oti){a)) - r{b) = 

= r((t3 0*2° ti){a) - 6) G r{5) C e 

and thus {t2ot\){r{a)) is stuck in r{h) + e. It follows that r(a) is persistently 
£-reducible. 

In the case that r e R, one knows from the fact that e is an i?-module 

that one can take 5 = e, and then the above has shown that Per£(S') is an 
i?-module. For more general r this need not be the case, and then one has 
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only shown about r that there for every e £ O is some 6 & O such that r 
maps Per5(S') into Per£(5). Suppose now that a € Per(5) is arbitrary, and 
consider the question of whether r(a) € Per (5"). Let e € O be arbitrary and 
letSed be such that r{FeTs{S)) C Pere{S). Since a G Per(S) C FeTs{S), 
it fohows that r(a) G Per£(5), and hence r(a) G neg(5P6re(5) = Per(5) by 
the arbitrariness of e. □ 

Definition 4.4. Let e G O be arbitrary. An a G is said to be e- 
uniquely reducible under T{S) if, for any ti, t2 G T{S) and 6i, 62 G Irr(S') 
such that ti{a) is stuck in 61 + e and t2{a) is stuck in &2 + it holds that 
bi + £ = b2 + £■ The set of all elements in which are both persistently 
and e-uniquely reducible under T{S) is denoted Rede(5). 

Lemma 4.5. For every e G O, the set Rede(/S') is an R-module that fur- 
thermore is mapped into itself by every reduction. 

Proof. Let e G O be given. Let 01,02 G Rede(S') be arbitrary. It follows 
from Lemma 14.31 that 01—02 G Per(S') and hence there exists ti G T{S) 
and b G Irr(S') such that ti(oi — 02) is stuck in 6 + e. Since oi, 02 G Per(S') 
there furthermore exist t2,ts G T{S) and 61,62 G Irr(S') such that t2(ii(ai)) 
is stuck in 61 + e and t^[(t2 o ti)(o2)) is stuck in 62 + This implies that, 
for t = ts o t2 o ti, 

6 - (61 - 62) = 6 - t{ai - 02) - (61 - t(oi)) + (62 - t{a2)) Ge-e + e = e. 

Starting from some other t'l G T{S) and b' G Irr(S') such that t'i(ai — 02) 
is stuck in b' + e, one similarly gets the existence of t2,t'^ G T{S) and 
61,62 G Irr(S') such that t2(i'i(ai)) is stuck in b[ + s and t'^[{t2 o t'i){a2)) 
is stuck in 62 + e; in precisely the same way one furthermore shows that 
6' — (6'^ — 62) G £. The e-unique reducibility of oi and 02 does however imply 
that 61 — b'l G e and 62 — 62 G £. From this follows that 6 — 6' G e and hence 
oi — 02 G Rede(S'). This has shown that Rede(S') is a group. 

It must also be shown that elements of R map Rede(S') into itself. Let 
r G R, e £ O, and a G Red£(S') be given. Let ti, t2 G T{S) and 61, 62 G Irr(5) 
be arbitrary such that ii (r(o)) is stuck in bi+e for i = 1,2. Since o G Per(S'), 
there exist t[,t2 G T{S) and 6'^, 63 G Irr(S') such that t[{ti{a)) is stuck in 
6^ + e for z = 1, 2. It follows that 

bi - r{b[) = [bi - {t[ o ti o r)(o)) + r((t^ o ti){a) - b[) G e + r(e) = e 

for i = 1,2. Hence 

61 - 62 = 61 - r(6'i) + r(6; - 6'2) + r(62) - 62 G e + r(e) +e = e 

and thus r(a) is e-uniquely reducible by the arbitrariness of ti and ^2- By 
Lemma 1131 r{a) is also persistently reducible, and so r(a) G Rede(5). 
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The corresponding property for reductions is more trivial, e-unique re- 
ducibility of t{a) for a £ Rede(5) and t G T{S) is the claim that any 
ti,t2 G T{S) and 61,62 G Irr(S') such that ti[t{a)) is stuck in 61 + e and 
t2{t{a)) is stuck in 62 + e satisfy 61 + e = 62 + £) but that is just a special 
case of the e-unique reducibility of a. □ 

Definition 4.6. An element in A4 is said to be uniquely reducible if it is 

e-uniquely reducible for all e £ C The set of those element which are both 
persistently and uniquely reducible under T{S) is denoted Red(S'). Define 
the map : Red(5') — > lvi{S) by letting t^(a) be the unique element of 
Irr(5) with the property that there for every e £ O exists some t S T{S) 
such that t{a) is stuck in t^{a) + e. 

The chain of sets defined in this section is thus that Red(S') C Red£(S') C 
Per(5) C PeVsiS) C M, with Red(5) = nega^^^e('^) PeT{S) = 
Pl^j^g Pere(S'). The end one wants to see is that all of these are equal, 
and in Lemmas 15.51 and 15.81 this is taken care of by giving sufficient condi- 
tions for Per£(5) = Ai and Rede(S') = M. respectively. A more immediate 
goal is however to establish that circumstances inside Red(5) are good. 

Lemma 4.7. The set Red(S') is an R-module that is mapped into itself by 
every reduction. The map t^ : Red(S') — > Irr(5) is well-defined and a group 
homomorphism. More generally, every continuous group homomorphism 
r: M. — > M. which commutes with all reductions maps Red(5) into itself 
and commutes with t^ . In addition, t^{b) = b for all b G Irr(S'), kert'^ C 
I{S), and t^{t{a)) = t^{a) for all a G Red{S) and t G T{S). 

Proof. Since Red(S') = P|^ggRed£(5') is an intersection of sets which by 
Lemma [4. 5 1 are i?-modules that are mapped into themselves by every reduc- 
tion, it follows that Red(S') shares these properties. 

Next consider t^ . It was shown in Lemma 14.31 that there for every a G 
Per(S') exists some b G Irr(S') which is a candidate for being t^{a), but what 
about uniqueness? One may observe that if a G Red(S') and 61,62 G Irr(5) 
are such that there for every e £ O exist ti,i2 £ T{S) such that ti{a) is 
stuck in 61 + e and t2(a) is stuck in 62 + e, then by e-unique reducibility 
61 — 62 G e for every e G O. Hence 61 = 62 as CI^^qS = {0} and thus t^ 
is well-defined. The same argument for (ti ot){a) and ^2(0) being stuck in 
61 + e and 62 + e respectively demonstrates that {t^ o t){a) = t^{a) for all 
t G T{S) and a G Red(5). Since an irreducible element 6 is always stuck in 
every neighbourhood of itself, it follows that t^{b) = 6 for all 6 G Irr(5). 

Now let ai,a2 G Red(5) be given and consider the matter of whether 
t^{ai + 02) = t^{ai) + t^{a2). Let e G O be arbitrary. By definition of t^ 
there exists some ti G T{S) such that ii(ai) is stuck in t^{ai) + e. Since 
02 G Per(S') there exists some t2 G T{S) and 62 G Irr(5) such that (t2oii)(«2) 
is stuck in 62 + e, and by unique reducibility of 02 it follows that 62 + e = 
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t^{a2) + £. Hence (t2 oii)(«i + 02) is stuck in t'^(ai) + t'^{a2) + £, and then by 
unique reducibility of ai+a2 it follows that t''^{ai+a2)+£ = t^{ai)+t^{a2)+£- 
Thus t^{ai + 02) = t^{ai) + t^{a2) by the arbitrariness of e. 

Next consider the matter of whether a continuous group homomorphism 
r: A4 — > A4 that commutes with all reductions will map Red(S') into itself 
and commute with . Let a G Red(S') be given and e € O he arbitrary. Let 
b £ Irr(5) and ti € T{S) such that ti (^(a)) is stuck in 6 + e be arbitrary. By 
the continuity of r there exists some 5 € O such that r{6) C e; let t2 G T{S) 
be such that t2(ti(a)) is stuck in t^{a) + 6. Since 

b - r(t^(a)) = b-{t2otio r){a) + r{{t2 o ti){a) - t^{a)) G e + r{6) = e 

it follows that r(a) is e-uniquely reducible, and by the arbitrariness of e 
that it is uniquely reducible. It is furthermore persistently reducible by 
Lemma l4.3t and hence an element of Red(5'). Finally t'^(r(a)) = r(t^{a)^ 
since it was in neighbourhoods of r(t'^{a)^ that images of r(a) could get 
stuck. 

Last, it should be verified that kert"^ C 2{S). Let a G keri'^ be given. 
Let e G O be arbitrary. There exists some t G T(S) such that t{a) is stuck 
in t^{a) + £ = £, and hence a — t{a) G a — e on one hand and a — t{a) G 
{b — t{b) I 6 G } C I{S) on the other, i.e., a is a limit point of 1{S). 
Since T{S) is topologically closed by definition, a G 2^(5*). □ 

A third property that Pere(5) and Rede(/S') should possess is to be topo- 
logically closed, but this does not happen automatically. A minimal example 
of a situation where Per£(S') is not topologically closed can be constructed on 
the formal power series foundation M. = Z[a], Ad = Z[a], and Bn = a"A^, 
if one picks as Ti[S) = {tnlJ^Li where 

I a™ otherwise 

for all n ^ 1 and m ^ 0. The problematic trait of this set of reductions is 
that (ti o . . . o t„)(a") = 1 ^ Bi for any n ^ 1, although the initial a" G Bn- 
This means no proper series a G \ is ever stuck in any set of the form 
6 + e, and consequently no such element can ever be persistently reducible. 
It follows that in this case Red(S') = Per(S') = Al, which is rather small 
compared to the closure M. 

On a conceptual level, what breaks down in this example is the principle 
that series truncation produces a useful approximation. Truncation works 
for a fixed reduction t G T{S) — in order to determine t{a) up to a certain 
number n of terms (i.e., in order to identify t{a) + B^) it is sufficient to 
determine t{b) where b is truncated to some number m of terms; this is the 
claim that t is continuous — but it may fail when the reduction is not fixed. 
What one would want is therefore a bound m on the number of terms that 
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must be taken into account that works for all reductions, and as it happens 
the property that such a bound exists has a name that is well known in 
analysis. 

Definition 4.8. A set F of group homomorphisms A4 — > M is said to be 
equicontinuous if there for every e £ O exists some 6 (z O such that for 
all / G F it holds that f{6) C e. 

Returning to the example, one may observe that Ti{S) = {tn}^=i actu- 
ally is equicontinuous (for e = i?„ take 6 = Bn+i), but what matters is that 
T{S) is not: no matter how small an a™" may be, there is always a com- 
position of simple reductions that magnifies it to something outside every 
e E C On the contrapositive side, when T{S) is equicontinuous then all the 
sets defined in this section become topologically closed. 

Lemma 4.9. IfT{S) is equicontinuous then t^ : Red(5') — > Irr(5) is con- 
tinuous and furthermore the sefePer(S), Per£(5), Red(5'), andKedc{S) are, 
for all e (z O, topologically closed in A4. 

Proof. Since by Lemma [4.71 is a group homomorphism, it suffices to show 
that it is continuous at 0. Let e £ O he arbitrary. Let 5 G O be such that 
t{6) C £ for all t G T(S); in other words every a G (5 is stuck in e. Any 
a G Red(S) fl 6 thus satisfies t^{a) G e, and hence t^ is continuous at 0. 

Now let e G O be given, and let 5 G O be such that t(6) C e for 
all t G T(S). Let a G A4 be an arbitrary limit point of Per£(S'), and let 
c G Pere(5) be such that c — a G 6. Let ti G T{S) be arbitrary. By 
persistent reducibility of c there exists some t2 G T(S) and b G Irr(S') 
such that {t2 o ti)(c) is stuck in b + e, i.e., (ts o t2 o ti){c) G b + e for all 
ts G T{S). By equicontinuity (t^ o t2 o ii)(c) — (is ot2 oti){a) G e and hence 
(t3 o t2 o ti){a) G b + £ as well, which means {t2 o ti){a) is stuck in 6 -|- e. 
By the arbitrariness of ti it follows that a G Pere(5), and hence that set 
must be topologically closed. Per(5) is thus known to be the intersection of 
a family of topologically closed sets, which implies that it too is closed. 

To show that Red£(5) is topologically closed, let a G Per(S') be a limit 
point of Rede (5). Let ti,t2 G T{S) and 5i, 62 G Irr(S') be arbitrary such that 
ti{a) is stuck in bi + e for i = 1,2. Let c G Rede(S') be such that c — a € 6. 
Since t(c) — t{a) G £ for all t G T(S), it follows that tj(c) is also stuck in 
bi + £ ioT i = 1,2, and hence 61 — 62 G e by the e- unique reducibility of c, 
whence a is e-uniquely reducible. All limit points of Rede(S') are in Per(S') 
and thus Red£(5') is topologically closed. Red(S') is similarly now known 
to be the intersection of a family of topologically closed sets, which implies 
that it is closed as well. □ 

Prom an analytical perspective, the effect of equicontinuity of T{S) is 
rather drastic — Irr(5) becomes sticky, in the sense that any a G Irr(S') -|- 6 
is stuck in the corresponding Irr(S') +£ (and even in a+£) — so in view of the 
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ruggedness of the proof of Lemma 14.91 oris might wonder whether equicon- 
tmuity really is The Right Condition for reaching the end that Red(5) is 
closed, but the jury is still out on that one. Looking at the proofs certainly 
suggests that it should be possible to make do with something weaker, but 
concrete applications rather tend to end up satisfying the stronger condition 
that t(e) C e for all e G O and t € T[S). Right now, the best reason for 
using equicontinuity is probably that it is well established and fully general; 
many other conditions which at first may seem to give finer control or be 
easier to verify are only defined with respect to some additional structure, 
such as a metric. 

The next lemma is the first step towards the Diamond Lemma. In nat- 
ural language, the first of the two equivalent claims is that all elements of 
M. are uniquely reducible, whereas the second claim is that every element 
has a unique normal form. 

Lemma 4.10. IfT{S) is equicontinuous and all elements of Ai are persis- 
tently reducible then the following claims are equivalent: 

• Red(5) = M. 

• M=lrr{S)®I{S). 

Proof. First assume Red(S') = A4. By Lemma HTTI t^ is a projection of A4 
onto Irr(5'), hence M = Irr(5) ©kert"^. By the same lemma, kert'^ C T{S). 
Hence M. = Irr(S') ®T(5') will follow if it can be shown that T{S) C kert"^. 
For any t G T{S) and a (z AAii follows from Lemma [4.7l that t'^(a — t(a)) = 
and hence { o — t(o) | a G 7W | C kert'^ for any t G T(S). Since kert"^ is 
closed under addition. 



and since it by Lemma 14.91 also is topologically closed, the wanted I{S) C 
kert'^ has been established. This has proved one half of the equivalence. 

For the other half, assume M = Irr(5) ®I{S). Let a G and e G O be 
arbitrary; it will be shown that a is e-uniquely reducible. Let ti,t2 £ T{S) 
and 6i, 62 G Irr(S') such that tj(a) is stuck in 6j + e for i = 1, 2 be arbitrary. 
By Lemma Upland the persistent reducibility of ii(a) and t2{a), there exist 
ci,C2 G Irr(5) such that there for every 5 € O exist ui,U2 G T{S) such that 
Ui[ti{a)) is stuck in Ci + 6 for i = 1,2. Clearly ui[ti{a)) — U2{t2{a)) G I{S), 
and since such ui and U2 exist for all 5 G O it follows that the limit ci — C2 G 
I{S) as well, but since also ci — C2 G Irr(S') and X(S') nlrr(S') = {0} it just so 
happens that ci = C2- For either 2 = 1,2 this common value is a limit point 
of a sequence of elements that are stuck in the topologically closed set bi + e, 
and hence ci — 6j G e. Therefore 61 — 62 S e and the two neighbourhoods 
are the same. By the arbitrariness of 61 and 621 the element a is e-uniquely 
reducible. □ 




teT(s) 
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The final lemma in this section explores a slightly different aspect of the 
machinery: how the sets change if some simple reductions are removed. It 
sometimes happens when one is preparing a presentation of an argument 
involving the diamond lemma that some of the reductions turn out to be 
redundant, but not all ways of verifying this redundancy are as easy as they 
may seem. The key condition for establishing equivalence of Ti{S) to Ti(S') 
— i.e., that all things constructed from the set of simple reductions are the 
same when the set of simple reductions is Ti(S") as when it is Ti{S) — is that 
their respective sets Irr(S') and Irr(S") of irreducible elements are the same; 
see Theorem 15.61 for a method of characterising the irreducible elements. 

Lemma 4.11. If Ti{S') C Ti{S) are such that Irr(S") = Irr(S'), Per(5") = 
M., Red(iS') = M., and T{S) is equicontinuous, then Red(5") = M. as well 
and t^' = t^ . 

Proof. Let a G and e € O be arbitrary. What needs to be shown is 
that a is e-uniquely reducible under T{S'). Therefore let 61, 62 € Irr(S") and 
ti,t2 € r(S") be such that ii(o) is stuck in 61 + e under T{S') and t2{a) is 
stuck in 62 + e under T{S'). Let 6 £ O be such that t{6) C e for all t £ T{S). 
By persistent (^-reducibility of a under T{S') there exist t[,t'2 G T{S') and 
b[,b'2 e Irr(5') such that t[{ti{a)) G b[ + S and i2(t2(a)) G ^2 + S. By 
equicontinuity this implies that t[ (ti(a)) and t2 (i2(«)) are stuck under T{S) 
in b[ +e and 62 +^ respectively. By e-unique reducibility under T{S) of a this 
implies that b[ — b'2 £ e. Furthermore bi — b[ = bi — t • (ti(a)) + 1 • (ti(a)) —b[£ 
e + (5 C e for i = 1,2 and thus 61 — 62 G e as well. Hence a is indeed 
e-uniquely reducible under T{S'), and it follows that Red (5') = Ai. 

Now let a G be given and consider the matter of whether t^ (a) = 
t^{a). Let e £ O he arbitrary and let 5 G O be such that t{6) Q e for all t G 
T{S). There exists some t G T(5") C T{S) such that i(a) G i^'(a) + 5, hence 
t{a) is stuck in t^ (a) + e under T{S), and consequently t^{a) G t^ (a) + e. 
It follows from the arbitrariness of e that t^{a) = t^ (a) . □ 

5 The core theorem 

At the heart of every diamond lemma lies an induction, and the role that y 
will play is as the domain of that induction, to which end it is necessary to 
order y. For many novices, the need to systematically order the monomials 
is by far the most unfamiliar aspect of working with the diamond lemma 
(and/or Grobner basis theory), and the problem of constructing a suitable 
order can be quite baffling. While this is not the place to give advice on 
how to attack that problem — see instead [7] for tips on this, in particular 
for issues regarding how the order interacts with the topology — it will still 
become necessary to reason about orders and their relations to other struc- 
tures. For that end, it helps to introduce a bit of notation for order relations. 
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which will facilitate discussions that simultaneously involve several orders. 
The aim is to allow the order to be an ordinary mathematical letter P (or 
more generally an expression), rather than a fancy symbol like y. 

The basic claim one can make (with respect to an order relation P) 
about a pair (//, u) of elements is that they are related by this relation. The 
usual formal interpretation of this is that (p,, u) G P, but notationally it is 
more convenient to write something like '/i ^ in P', to clarify that this 
is a non-strict inequality and that ji is on the "small side" . Using that one 
then defines 

u ^ ^in P, 

fu. ^ u in P and v ^ /j, in P, 
ly < fiin P, 

^ ^ 1/ in P and ^ n in P. 

If P is a partial order then /i i/ in P is the same thing as /i = i^, but if 
P is a more general quasi-order then this need not be the case. It is often 
convenient to construct a complex partial order by a sequence of refinements 
of some simpler quasi-order. Not every partial order will usefully support 
inductions however, so an additional property is needed. 

Definition 5.1. A binary relation P on 3^ is said to satisfy the topological 
descending chain condition (or to be TDCC for short) if lim„^oo = 
for every infinite sequence {//„}^o ^ y such that > Hn+i in P for all 
n G N. 

An informal phrasing of the ordinary descending chain condition (DCC) 
is 'there is no infinite strictly descending chain', although it is important to 
observe that 'descending' here implicitly requires that the chain elements are 
indexed. An index-free formulation of the DCC is 'every nonempty subset 
has a minimal element', and when this definition is given one usually speaks 
about the order being well-founded (which is thus a synonym of DCC). Note 
that asking for a minimal element is weaker than asking for a minimum 
element; the latter would give rise to a well-order, which in particular is 
always a total order. 

The next lemma gives the precise form of an induction over y-, condi- 
tion (i) provides the induction base, whereas the verification of condition (ii) 
is the induction step. 

Lemma 5.2. Let P be a partial order on y that is TDCC. If Z cy is such 
that: 

(i) there exists an e E O such that e oy C. Z , and 

(ii) if jj ^ y is such that {uey\u<ij, in P}CZ then jj, G Z; 



H ^ u in P 
H < u in P 
^ > u in P 
u ly in P 
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then Z = y. 



Proof. Let Z be an arbitrary proper subset of y which satisfies (ii); it wiU 
be shown that Z does not satisfy (i). To see this, let ^ y \ Z. For any 
IJ-n £ y\Z there must exist a fin+i € y\Z such that fj-n+i < fJ-n in P, because 
if that was not the case then all v y which satisfy v < P would also 
satisfy ly & Z, and hence by (ii) //„ G Z, which would be a contradiction. 
Thus there exists an infinite P-descending sequence {nn}^=i y\ Z, and 
hence by TDCC lim„^oo = 0. In other words there exists for each e S O 
an integer such that /x„ € e for all n ^ N, and thus e ny 3 fiN ^ Z. 
Hence Z does not satisfy (i). □ 

For such inductions to be useful in the present context, it is however 
necessary that the reductions used comply with the order. 

Definition 5.3. If P is a binary relation on y and n € y then DSM(^, P) 
denotes the least topologically closed -R-module of M which contains all 
v € y such that u < fiin P, i.e., 

DSM(/i, P) = Cspan({ u e y \ u < fi in P}) . (5.1) 

The set DSM(//, P) is called the down-set module of with respect to P. 

A reduction t € T(S) is said to be compatible with the relation P if 
t(/x) G {fj,} U DSM(/x, P) for all € y. A set of reductions is said to be 
compatible with P if all its elements are compatible with P. 

An element g M is said to be P-monic if there exists some fi & y 
such that g — fi & DSM(;U, P). A subset of ^A is said to be P-monic if all 
elements in it are P-monic. 

Down-set is a term from poset theory, but the standard name there for 
this concept is ideal rather than down-set. That terminology has however 
been avoided so that no confusion with the ring-theoretic ideal concept will 
arise. 

That an arbitrary map A4 — > M should be compatible with some 
order relation P is a rather strong condition, but for reductions it is often 
something which comes naturally. For a simple reduction t^i-^a satisfying 
(|3.ip . it boils down to the condition that a € DSM(/x, P), and when that is 
the case then /i — a is P-monic. Conversely, if (jS.ip for any (^u, o) G 3^ x 
A4 defines a continuous homomorphism t^n^a that commutes with elements 
of R, then every P-monic g G ^A gives rise to some map tf^^^-g that is 
compatible with P. Grobner basis theory preaches an extreme form of this, 
where the normal state of things is that the leading monomial /j, is split 
off from a basis element g every time something is to be reduced modulo g 
(although it is recognised that caching ^ with g can improve performance). 
This approach is facilitated by the fact that Grobner basis theory normally 
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only considers total orders, as that guarantees that there always is a unique 
leading monomial to split off. 

In more general cases, compatibility is often something one arrives at 
indirectly. When preparing to apply the diamond lemma, one often starts 
with some reductions t^^a that one wants to use, and faces the task of 
constructing some P with which these would be compatible. (This P will 
also have to satisfy some other conditions, in particular the TDCC and in 
most cases some variant of ()6.4p . which constrains the possibilities quite a 
lot.) If the given reductions do not generate all the wanted congruences, 
then the next step is to find a set of P-monic generators which cover the 
rest, and then make additional simple reductions from these. At each step 
one's choices are restricted by the need to ensure compatibility further on, 
but when the set-up is complete it is usually a trivial matter to verify the 
compatibility of simple reductions. The next lemma then extends this result 
to general reductions. 

Lemma 5.4. If P is a partial order on y with which t £ T(S) is compatible, 
then for all e y and b G DSM(//,P) it holds that t{b) G DSM(//,P). If P 
is a partial order on y with which ti,t2 € T(S) are compatible, then t2 o ti 
is compatible with P as well. If Ti{S) is compatible with a partial order P 
on y then the whole ofT{S) is compatible with P. 

Proof. For the first claim, consider first some special 6, and then generalise 
following the characterisation (15. ip of DSM(/i,P). li b = v £ y satisfies 
1/ < /i in P then DSM(i/, P) C DSM(/i, P) and consequently t{v) G {u} U 
DSM(i/, P) C DSM(;U, P) as claimed. If r G P* is any finite composition of 
elements of R then t{r{u)) = r{t{v)) G r(DSM(//,P)) C DSM(//,P), thus 
extending the result to b on the form r{u). If 61,62 G DSM(/i, P) are such 
that t{bi),t{b2) G DSM(//,P) then clearly t{bi - 62) G DSM(/x,P) as well, 
and since this establishes that the set of 6 for which the result holds is a group 
it follows that the result holds for arbitrary 6 G Span({ zvG3^|z^</UinP}). 
Finally if {6„}^^i C DSM(/i,P) are such that t(6„) G DSM(/i,P) then 
t{b) G DSM(^, P) also for b = lim„,^oo 6„ by continuity, and thus the result 
holds for all b G DSM(//,P). 

For the second claim, let G 3^ be arbitrary. If = /x then 

(^2 o ti){fj,) = t2{fJ,) G {/Li} U DSM(;U,P) by the compatibility of t2 with 
P. Otherwise G DSM(/x,P) and thus (t2 oii)(yu) G DSM(/i,P) by the 
first claim. Hence t2 o ti is compatible with P. The third claim immedi- 
ately follows from the second and the observation that the identity map id 
is compatible with all relations. □ 

An intuitive picture which might be useful is to think of the down-set 
module of /i as a sort of cone with fi at the apex. This picture is deceiving 
insofar as it represents entire P-modules of the form Span({/i}) as single 
points and does not even begin to consider the topological structure, but it is 
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nonetheless very much to the point. In that picture, one might interpret the 
above lemma as saying compatible reductions cannot map elements inside 
a cone to elements outside it. This is similar to how the sets FeVsiS) of 
persistently e-reducible elements behave with respect to reductions, and 
indeed the next lemma makes use of down-set modules in showing that 
Per£(S') is the whole of A4. 

Lemma 5.5. Assume T{S) is equicontinuous and compatible with some par- 
tial order P on y. If P satisfies the topological descending chain condition 
then Per(S') = M. 

Proof. Let e € O be arbitrary. Let Z be the set of all elements of y which 
are persistently e-reducible. By equicontinuity there exists some 6 € O such 
that t{6) C e for all t € T(S). Hence all fi £ y n6 belong to Z, since these 
satisfy E e for all t E T{S). These elements constitute the base for the 
induction, fulfilling condition (i) of Lemma 15.21 

For the induction step, consider some arbitrary ^ £ y. Assume that all 
I' y such that z/ < /i in P satisfy i/ E Z; it will now be shown that this 
implies /j. € Z. To that end, let ti E T{S) be given and try to find some 
t2 E T{S) and a E Irr(5) such that i2(^i(A*)) is stuck in a + e. It is useful 
to observe that the induction hypothesis, by Lemmas 14.31 and 14.91 implies 
DSM(/i,P) C FeieiS). 

Depending on /i and ti, there are three cases that can occur. If E Irr(S') 
then ti{ii) = fi is stuck in + e and hence /j, G Z. If ti(/u) 7^ /i (and 
hence n ^ Irr(S')) then by compatibility of ti with P it follows that ti(//) E 
DSM(//, P) C Per£(S') and consequently by this persistent e-reducibility of 
ti{^) there exist a E Irr(S') and t2 E T{S) such that i2(ii(;^)) is stuck in 
a + e. Finally, if /u ^ Irr(S') but ti(/i) = /i then there still exists some 
t2 E T{S) such that t2(/i) / fj. and thus t'^ifi) E DSM(^,P) C Per£(S'). As 
before there now exist ^ T{S) and a E Irr(S') such that t2(^2(/^)) is stuck 
in a + e, whence for t2 = t2 o t2 one finds that t2{ti{fj,)) is stuck in a + e. 
Either way, fi Z hy the arbitrariness of ti, which completes the induction 
step. 

All conditions for Lemma 15.21 are now fulfilled and hence Z = y. By 
Lemmas 14.31 and Lemma l4.9| Pere(S') = A4. By the arbitrariness of e, it 
then follows that Per(S') = f]^^^ Pere(5) = M as well. □ 

The same conditions also suffice for giving an explicit description of 
Irr(5'). It is not unusual that one can quickly establish this result also 
through more elementary arguments, but for complicated set-ups it is conve- 
nient to have a proof relying on (a subset of) the conditions of Theorem l5.111 

Theorem 5.6. Assume T{S) is equicontinuous and compatible with some 
partial order P on y. If P satisfies the topological descending chain condi- 
tion then 

Irr(5) = Cspan({ e y \ t{fi) = fi for all t E Ti(5) }) . (5.2) 
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Proof. Let W be the set of irreducible elements of y. It follows from 
Lemma 13.51 that the left hand side Irr(S') of (j5.2p contains the right hand 
side Cspan(VF). The reverse inclusion will be established by demonstrating 
that 1tt{S) ^Span(Ty) + e for all e € O. 

Let £ e d be given. Let 5 € O be such that t{6) C e for all t £ T{S). 
Let be the set of those a € such that there for every ti € T{S) exists 
some t2 € T{S) such that for every G T{S) it holds that (ts o t2 o ii)(a) S 
Span(Ty) + £. It will now be shown that N = A4. 

To see that A'' is a subgroup of Ai, let ai, a2 G N he given. Let ti £ T{S) 
be arbitrary. There exists some ^2 G T{S) such that {t-s 0^2° G 
Span(VF) + £ for every G T{S). There also exists some £ T{S) such 
that {ts o o [t2 o ii))(a2) € Span(VF) + e for every G T{S). Hence 
(ts o (t4 o t2) o ti) (ai - 02) = ((is o ^4) 0*2° h) (ai) - {ts 0*4° (t2 o ti)) (02) G 
Span(VF) +e for every ^3 G T{S), and thus ai — 02 G by the arbitrariness 
of ti. Elements of R map N into itself because they commute with all 
reductions and map Span(l^) + e into itself, hence A'' is an i?-module. A'' is 
topologically closed because any t G T{S) maps N + 6 into t{N) + £; if a £ N 
then there is some a' £ N Ci (a + 6), hence for any ti £ T{S) there exists 
some t2 £ T{S) such that (is 0*20 ti){a') £ Span(VF) + e for all £ T{S), 
and thus {t^ 0*2° ti)(a) G (is 0^2° ti){a') + e C Span(l^) + £ + £. 

The proof that 3^ C A'" is done by induction. As usual, J' fl (5 C 5 C A^ 
since t{6) C + e for all t G T{S). For the induction step, let ^ G 3^ 
be arbitrary and assume v £ N for all G 3^ such that u < ji m. P. If 
jjL £ W then t{ij) £ W for all t £ T{S) and hence ^ £ N. Otherwise 
let ti £ T{S) he given. If ti(^) = /U then let t2 £ Ti{S) be such that 
^2(/^) 7^ otherwise let ^2 = id. Since (t'^ o ti){p) 7^ /i it follows from 
compatibility, the induction hypothesis, and the previous paragraph that 

o ti){fi) £ DSM(/i, P) C A^. Hence there exists some t2 £ T{S) such that 
any ^3 G T{S) satisfies {t-s o t2 o t'2 o ti){fi) £ Span(l^) + e and thus n £ N. 
Since P is TDCC, the conclusions 3^ C A^ and N = M follow. 

Finally, let a £ Irr(5') be arbitrary. Since a £ N there exists some 
t £ T{S) such that t{a) £ Span(VF) + £, but t{a) = a by irreducibility. 
Hence Irr(5) C Span(T^) + e by the arbitrariness of a, and it follows that 
Irr(S) C n^gg(Span(iy) + e) = Cspan(VF). □ 

The Red(5) = Ai counterpart of Lemma [531 is Lemma [5^ but unique 
reducibility requires another condition, wherein the following definition is 
handy. 

Definition 5.7. Let P be a binary relation on 3^. Then for any fi £ y, 

define 

DIS(/i,P,5) = Cspan(^{iy -t{u) | 1/ < /xinP,t G Ti{S) }). (5.3) 

Also write a = b (mod S < fi in P) (read "a is congruent to b mod S less ^ 
in P") as a shorthand for a — b £ DIS(^, P, S). 
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The etymology of this DIS notation is "Down-set T{S) Section", even 
though one cannot in general interpret DIS(/x, P, 5) as being synonymous 
to DSM(/i,P) n It is clear that DIS(/u,P,5) C I{S), and if T{S) 

is compatible with P then DIS(//, P, S) C DSM(/u, P) by Lemma [531 but 
for equality with the intersection to hold one pretty much have to fulfil the 
conditions of Theorem 15.111 It is however not so interesting to exactly map 
the extent of this module; one rather seeks to prove that particular elements 
of Ai belong to it by exhibiting explicit expressions for them. Calculations 
are often convenient to express in the form a = ai = • • • = = (mod S < 
Hin P). 

Lemma 5.8. Let P be a partial order on y. Assume Ti{S) is such that for 
all fj: & y and simple reductions ti,t2 G Ti{S) that act nontrivially on ^ it 
holds that 

ti{fi)-t2{f^)eBlS{fi,P,S). (5.4) 

If furthermore P satisfies the topological descending chain condition and 
T{S) is equicontinuous and compatible with P, then Red(S') = M. 

Proof. It will be shown by induction over y that Rede(5') = M. for all e G O. 
Observe that Per(S') = Aihy Lemma [5. 5 1 hence it is sufficient to prove that 
all elements of M. are e-uniquely reducible. Let e G O be given and let 
(5 G O be such that t{5) C e for all t G T{S). The induction hypothesis is 
that the fj. £ y under consideration satisfies fi G Rede(S'). The induction 
hypothesis clearly holds for all /u G 3^ such that n G 6, since t{fj,) G e for all 
such fi and all t G T{S). This has laid the base for the induction. 

For the induction step, assume that u G Red£(5) for all G 3^ such that 
v < ^ in P, and consider fi. By Lemmas 14.51 and 14.91 this assumption is 
equivalent to DSM(;^,P) C Rede(S'). Let ti,t2 G T{S) and 61,62 G Irr(5) 
be arbitrary elements such that ti{iJ,) is stuck in 61 + e and t2(^) is stuck in 
62 + £• The problem now is to show that bi+e = b2 + S- A trivial case occurs 
if ti or t2 acts trivially on fj,; it can be assumed without loss of generality 
that t2 acts trivially. In this case n = t2{^i) is already known to be stuck in 
62 + and hence ti(^) is stuck there as well. 

With that taken care of, it can be assumed that ti and t2 both act 
nontrivially on /i. Thus there exist iia,i2a G Ti{S) and tii,,t2b G T{S) such 

that: ti(/i) = tib{tiaifJ-)) , t2{n) = t2b{t2a{ll)) , hain) / ^ and t2a(^) / 

By persistent reducibility there also exists some t^ G T{S) and 63 G Irr(S') 
such that tsitiaifi) — i2a(^)) is stuck in 63 +e. By e-unique reducibility of 
tiaifJ-) there exists some tic G T{S) such that {ticOt3otia){n) is stuck in bi+e, 
and similarly there exists some t2c G T{S) such that (t2c ° he 0^3° i2a)(/u) is 
stuck in 62 + Let t^ = t2c ° he- Since 

61 - 62 = (61 - (t4 0*30 tia){fl)) + 

+ (^4 O t3)(ila(Ai) - i2a(^)) + ((^4 0*3° i2a)(^) " 62) G 

G £ + (63 + e) + e. 
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it would follow that bi + e = b2 + £ £ e. 

By assumption iia(^) — t2a(Ai) E DIS(^, P, 5). Thus there exist {/Oi}"^;^ C 
3^, reductions {ui}"^^-^ C Ti{S), and G Span({pj}) for each i = I, . . . ,n, 
such that Pi < in P for i = 1, . . . , n and 

n 

(ila(/^) - t2a(/i)) - - '"i(ai)) e 

1=1 

Now the idea is to construct and consider a reduction that takes each term 
of this expression to an e-neighbourhood of its normal form. Let wq = ts 
and for each i = 1, . . . ,n: let Vi,v^ £ T{S) and Cj, € Irr(5) be such that 
Vi(^Wi-i{ai)^ is stuck in some Cj + e and ^^((t'i o Wi-i o ?Xj)(ai)) is stuck in 
c^ + e, then define = v'-oviOWi^i. By e-unique reducibility of Oj it follows 
that Cj + e = + e and thus Wn{ai — Ui{ai)) G e. Hence 

^i=l 

n 

= ^ Wn(ai - Ui{ai)) + 'u;„((5) C e. 

i=l 

Since also Wn{tia{p) — t2a(/^)) G fts + it follows that 63 G e and 61 + e = 
62 + This completes the induction step. 

By Lemma 15.21 the induction hypothesis holds for all G 3^, and hence 
y C Rede(S'), which implies Red£(5) = 7W. Since e was arbitrary, Red(5) = 
n£>o Rede (5) = Mas weh. □ 

The next definition helps simplify the main condition (j5.4|) of Lemma [5.8l 
to "assume all ambiguities of Ti{S) are resolvable relative to P", which is 
one of the main equivalent conditions in the diamond lemma. 

Definition 5.9. An ambiguity of Ti{S) is a triplet (ti, /i, ^2)1 where ti, t2 G 
Ti{S) act nontrivially on /j, € y; the ambiguities ^2) and {t2,n,ti) are 

considered equivalent. An ambiguity (ti,;U,t2) is said to be resolvable if 
there, for every e € O, exists reductions t3,t4 G T{S) such that t^ltidj,)^ — 
i4{t2{p.)) G e. An ambiguity ^2) is said to be resolvable relative to 

a binary relation P on 3^ if ti{fi) — t2{fJ-) G DIS(/i, P, 5). 

The essential content of the ambiguity concept has been given a bewil- 
dering variety of names, where 'ambiguity' is that used by Bergman [2]. 
The most common term is rather critical pair, but there appears to be no 
consensus on what the elements of the critical pair are. Baader-Nipkow [U 
Def. 6.2.1] effectively defines a critical pair to be some t2(^)) and 

informally speaks of the ambiguity (ii,/i,t2) from which it came as a fork. 
This critical pair terminology would make sense within the present frame- 
work, but it cannot completely replace ambiguities, as there is not enough 
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information in the critical pair to define relative resolvability. In contrast, 
the definition of 'critical pair' in Grobner basis theory (a pair of Grobner 
basis elements) is technically quite different and cannot be stated in the 
generic framework, although the essential content is still the same. 

The second most common name is probably 'overlap', but although over- 
lap ambiguities are by far the most important ones, there are also important 
ambiguities which aren't overlaps; the taxonomy of ambiguities is a subject 
of Section[6l Rarer names still are 'composition' (Shirshov [14J and Bokut [3], 
hence the alternative name composition lemma for the diamond lemma) and 
'superposition' (Knuth-Bendix [9j), both of which refer primarily to the ^ 
part of an ambiguity (ti,;U,t2)- 

It should also be pointed out that many of the above concepts presume a 
certain minimality — the critical of 'critical pair' refers to that these are the 
ones that really need to be checked — whereas the above ambiguity concept 
has no such restriction. This is because the mechanisms traditionally used 
to discard some ambiguities as redundant rely on structures not apparent 
in the basic framework (^A4, R,y,0,Ti{S)^ , and therefore not available in 
this generality. Corresponding results for the present setting can be found 
in Section [6l 

It is common to say that a rewriting system S is confluent if everything 
has a unique normal form, but since this by Theorem lS.lll is equivalent to a 
number of quite different conditions, one shouldn't be surprised if different 
authors define it differently. Taking [T] as authority, where confluence is 
defined for reduction relations, one may call Ti{S) locally confluent if all 
ambiguities of Ti{S) are resolvable; this adjusts the traditional definition 
to allow for topology and take advantage of the i?-module structure, but is 
otherwise a strict interpretation. Global confiuence is, assuming persistent 
reducibility, more directly equivalent to unique reducibility: Ti{S) is globally 
confluent if there for every fi £ y, all ti,t2 G T{S) {not only simple reduc- 
tions), and every e e O exist t3,t4 E T{S) such that t3(ti(/i)) —t4(t2(Ai)) € e. 
The two may seem similar, but a proof that local confiuence implies global 
confiuence (which essentially is what the original diamond lemma of New- 
man [12] was all about) requires something like an induction over y to go 
through. 

As will become clear in the next section, relative resolvability is more 
important in the theoretical machinery than plain resolvability, since it more 
easily lends itself to reasoning about elements of \3^. On the other hand, 
plain resolvability is usually a more natural goal to aim for in practical 
calculations. The next lemma says that it is a sufficient condition also for 
relative resolvability. 
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Lemma 5.10. IfT{S) is compatible with the partial order P on y, then 



DIS(/z,P,5)= {a-t{a)\a(£BSM{n,P)} (5.5) 

teTiiS) 

= ^ { a - t{a) I a G DSM(/x, P) } (5.6) 

teTiS) 

for all £ y and each resolvable ambiguity ofTi{S) is also resolvable relative 
to P. 

Proof. Let fj, € y he given. It is clear that (|5.5p and (|5.6p hold with C 
inclusions, so what needs to be shown are the 2 inclusions. In ()5.6p . one 
may observe that any t E T{S) \ {id} decomposes as t = t^ o ■ ■ ■ o ti for 
tl, . . . ,t„ G Ti{S) and that a — t{a) = Y17=i{^i ~ ai = a and 

Oj+i = {ti o ■ ■ ■ o ti){a) for i = 1, . . . , n — 1. By Lemma [5^ Oj G DSM(/i, P) 
for i = 1, . . . , n, and hence 

{ a - t{a) I a G DSM(/i, P), t G T(5) } C ^ { a - t(a) | a G DSM(/i, P) } . 

ieTi(5) 

In (j5.5p . one must instead decompose the elements of DSM(/x, P). Let t G 
ri(5) and a G DSM(;U,P) be arbitrary. Let e G O be arbitrary. Let 5 G O 
be such that 6 e and t{6) C e. By Lemma [2^81 there exist i^i, . . . , f„ G 3^ 
and ri, . . . , r„ G ibP* such that z/j < in P for i = 1, . . . , n and 6 : = 
SiLi '^ii'^i) ^ CL + 6. Clearly 

n / n \ n 

i=l ^t=l ^ i=l 

and since (6 — 1(&)) — (a — t(a)) G e it follows, by the arbitrariness of e, that 
a — t{a) is a limit point of DIS(/x, P, S). Since this set is topologically closed 
by definition, a — t(a) G DIS(;U, P, S) and hence 

^ { a - t(a) I a G DSM(/x, P) } C DIS(//, P, 5) 

teTi(5) 

by the arbitrariness of a and t. (|5.5p follows. 

Let (^1,//, ^2) be a resolvable ambiguity of Ti(5), let oi = ti{fi), and let 
02 = *2(a')- Let e G O be arbitrary. Let t^jt^ G ^(5") be such that ^3(01) — 
^4(02) G e. By Lemma [531 «i)«2 £ DSM(/x,P). Hence 6 = (ai — ^3(01)) + 
(—02 — t4(— 02)) G DIS(/i, P, 5) and (ai — 02) — b = t3(ai) — ^4(^2) £ £• In 
other words, ai — 02 is a limit point of DIS(/i, P, 5). As above, it follows 
that ai - a2 G DIS(/i,P,S'). □ 

With that final implication, the big equivalence in the generic diamond 
lemma is now apparent: 



40 



Theorem 5.11. // T{S) is equicontinuous and compatible with a partial 
order P on y that furthermore satisfies the topological descending chain 
condition, then the following claims are equivalent: 

(a) Every ambiguity ofTi{S) is resolvable. 

(a') Every ambiguity ofTi{S) is resolvable relative to P. 

(b) Every element of M is persistently and uniquely reducible, i.e., M = 
Red(5). 

(c) Every element of M has a unique normal form, i.e., A4 = Irr(S') © 
I{S). 

Proof. By Lemma 15. 5| Per(S') = A4. Hence (b) and (c) are equivalent 
by Lemma 14.101 (a) implies (a') by Lemma 15.101 and (a') implies (b) by 
Lemma 15.81 Hence the only thing left to prove is that (b) implies (a). 

Let an ambiguity {ti,fj,,t2) be given. Let e G O be arbitrary. Since 
H G Red(5) there exist ^3,^4 G T(S) such that t3(ti(/x)) and t4(t2(/u)) are 
stuck in t^{^) + e. Hence t^itidi)) — t4(t2(Ai)) G and thus the ambiguity 
is resolvable. □ 



6 Ambiguities 

Most applications of a diamond lemma has as one of its main steps the 
calculations for checking that the ambiguities are resolvable. In Grobner 
basis theory this is even more central, with various completion algorithms 
being driven by these ambiguity resolution calculations (and adding the twist 
of modifying the set of reductions whenever it is found that an ambiguity 
fails to resolve). From the theoretical foundations these algorithms above 
all else need some criteria for discarding ambiguities that don't need to be 
checked, as there typically are infinitely many triplets (t, ;U, u) which qualify 
as ambiguities under Definition 15.91 In order to formulate such criteria one 
needs some extra structure however, and one that performs very well is to 
have a family of advanceable maps. 

What makes advanceable maps useful for structuring a set of ambiguities 
is primarily that one ambiguity can be the image of another ambiguity; 
indeed, for e.g. the family (13. 8p of advanceable maps every image of an 
ambiguity is another ambiguity. Such images are however never more than 
shadows of the original ambiguities, since also the ambiguity resolutions 
can be transported to the image by the advanceable map. This argument 
(see Lemma 16.21 for the formal claim) is classically used to prove that it 
is only necessary to check resolvability for minimal ambiguities (since any 
non- minimal ambiguity would be a shadow of a smaller ambiguity), but 
it can also be used to prove that it is sufficient to check one labelling of 
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an expression, since relabelling maps are often advanceable. A catch is 
however that even if one can skip checking any particular shadow ambiguity, 
it does not necessarily follow that all shadow ambiguities can be skipped — 
the "original" of which an ambiguity is a shadow can itself be a shadow of 
the shadow (e.g. relabellings are typically invertible). The 'shadow-critical' 
concept of Definition 16.81 is one way around this catch, even though it in 
general doesn't discard everything that might be skipped. 

A major complication when considering shadow ambiguities is that the 
original ambiguity will often have a different sort — reside in a different 
base set — than the shadow that one wishes to resolve. Therefore it is 
in this section necessary to make the multiplicity in the basic framework 
[j\4, R,y,0,Ti{S)^ explicit, and think in terms of a family of such frame- 
works. Thus there is a set / (the set of "sorts") which serves as the index 
set for the family of frameworks, and for each i G I there is a quintuplet 
{M{i), R{i),y{i), 0{i),Ti {S){i)) where: 

• ^A{i) is a topological abelian group (and 7W(i) is its completion). 

• R{i) is a set of continuous group endomorphisms on and hence 
on M{i). 

• y{i) is a spanning subset of M.{i). 

• 0{i) = {Bn{i)}^^i is a family of i?(i)-modules (hence subsets of M-{i)) 
that constitutes a family of fundamental neighbourhoods of 0. 

• Ti{S){i) is a set of continuous group endomorphisms on M{i) that 
commute with elements of R{i). 

As with the five main pieces, 

• the particular partial order on y{i) will be denoted P{i), but all as- 
sumptions on this are explicit, like they were in the previous section. 

Other things defined from the basic framework are similarly specialised to 
a sort i by appending an '(i)' to the symbol; the parenthesis notation for 
indexing may seem a bit peculiar, but it is traditional for operads, which have 
inspired much of the multi-sorted extensions to this formalism. Operads 
have I = N with the index i being the arity of the elements concerned, so 
there is one set Irr(S')(0) of irreducible constants, another set Irr(S')(l) of 
irreducible unary operations, yet another set Irr(5')(2) of irreducible binary 
operations, etc.; other types of algebraic structures typically require other 
index sets. What happens with respect to ambiguities is that each Ti(S){i) 
has its own ambiguities, but it is frequently the case that they turn out to 
be shadows of ambiguities in some Ti{S){i'). 

The family V of maps that one wants to have advanceable reacts differ- 
ently to the introduction of several sorts: it acquires two sort indices, since 
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there is no reason the codomain should have the same sort as the do- 

main A4(i'). It is however not until Definition 16.81 that this has to be made 
explicit; before that it is sufficient to reason about specific advanceable maps 
that relate to specific ambiguities. An underlying set 5 of rewrite rules will 
typically not carry sort indices, as every element of it contributes to every 

Tiism. 

Definition 6.1. Let I be given. A map v: — > A4{i) is said 

to be advanceable with respect to Ti(5)(i') and Ti{S){i) if there for every 
t' G Ti{S){i') and a G R*y{i') exists some t G T{S){i) such that t{v{a)) = 
v(t'{a)^. The map v is said to be absolutely advanceable with respect to 
T{S){i') and T{S){i) if there for every t' G T{S){i') exists some t G T{S){i) 
such that V ot' = to V. 

An ambiguity {t,ii,u) of Ti{S){i) is said to be a shadow of the am- 
biguity {t',fi',u') of Ti{S){i') if there exists an advanceable continuous ho- 
momorphism v. A4{i') — > ^A{^) such that /i = v{fi'), t{fj,) = and 
= v[u'{fi')) . The ambiguity {t, /x, u) is said to be an absolute shadow 
if V is absolutely advanceable. 

In the classical case of Bergman's diamond lemma, there is only one sort 
— preferably denoted 1 for consistency with the operad generalisation — 
and hence the indices could be dropped. Not dropping indices, one would 
have M{1) = 7^(X), y{l) = X*, and ri(5)(l) being the set of all maps 
tuisu2 on the form (13. 5|) : an ambiguity is thus some (iAisii/u ^A2S2i^2) where 
A 1 /is ^1^1 = /i = A2//s2^2- However, if Ai and A2 have some common prefix 
K (i.e., Ai = kX[ and A2 = for some k, A'^,A2 G X*) and/or z^i and 1^2 
have some common suffix p (i.e., i^i = I'lP and V2 = v'2P for some 1/2, /O G 
X*) then for p! = )k^Ps^v\ = }^2Ps2^''i one finds that (tx^s^v^^ i^-.'^x^s^vi) is a 
shadow under the absolutely advanceable map v{\)) = nbp of the ambiguity 

(*A'jSi!^( ) /^'> ^A^S2;^2)' ^^'^^ unique factorisation in the free monoid X*, 
it follows that the only ambiguities that are not such shadows are those 
where at least one of Ai and A2, and at least one of i^i and 1^21 are equal to 
the identity 1. This, with the help of the next two lemmas, cuts down the 
number of ambiguities that explicitly need to be resolved quite considerably. 

Lemma 6.2. // an ambiguity is resolvable then all its absolute shadows are 
resolvable as well. 

Proof. Let the indices i,i' G I be given. Let {t[,p',t2) be a resolvable 
ambiguity of Ti{S){i'). Let {ti,p,t2) be an arbitrary ambiguity of Ti{S){i) 
that is an absolute shadow of {t'i,p',t2), and let v. M(i') — > A4{i) be 
the absolutely advanceable continuous homomorphism which links the two 
ambiguities. Let e G 0{i) be arbitrary, and let 6 G 0{i') be such that 
v{6) C e. Since {t[,p',t'2) is resolvable there exists ^3,^4 G T{S){i') such 
that t'^(t[{p')^ —t'^(t2{p')) G 5. By absolute advanceability of v there exists 
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^4 G T{S){i) such that o v = v o t'^ and t/^ o v = v o t'^. Then 



h{tl{^l)) -U{t2{l^)) ={t3 0ti){v{tl'))-{Uot2){v{tl')) = 
= {vot'^ot'^){^')-{vot'^oQ{^) = 

= v{it',ot[){f,')-{t',ot',){fi'))evid)Ce, 



and since e was arbitrary it fohows that is resolvable. 



□ 



In this lemma, it would not have been sufficient to assume conditional 
advanceability, and it is instructive to consider why. Suppose ti{n') is not 
a single element of R*y{i'), but is instead the sum A + of two different 
elements of y{i')- Suppose further that € Ti{S){i'), because the extension 
to non-simple reductions in absolute advanceability is not the main issue. 
If V is advanceable then there certainly exist t^Xjt^i, € T{S){i) such that 
t3\{v{X}) = v{t'2{\)) and t3u{v{iy)) = v{t'2{u)), but there is no guarantee 
that there is some € T{S){i) such that t^^v^X + v)) = v[t'^{X + u)). In 
very many cases it would probably turn out that something like t^x ° t^^ 
acts exactly as the ta one needs, because good choices of simple reductions 
tend to act trivially on large subsets of but since it cannot in general 

be assumed that t^i^ acts trivially on w(A), that composition will sometimes 



The idea to first reduce one term, and then the next, is basically good but 
requires some kind of book-keeping device to work. Provided that simple 
reductions are of the t^n^a kind (i.e., each only acts nontrivially on one 
element of 3^), a possibility would be to use the order on y{i) and start 
with the smallest terms, but a more powerful solution is to go for relative 
resolvability instead; this provides for reducing different terms independently 
of each other. The small price one has to pay is a condition on how the 
advanceable map behaves with respect to the partial orderings. 

Lemma 6.3. Assume the ambiguity (t,fi,u) ofTi{S){i) is a shadow of the 
ambiguity {t',fj,',u') of Ti{S){i'), that the corresponding advanceable map 
v. M{i') — > M{i) satisfies 



and that T{S){i) is compatible with the partial order P{i)- If {t',fj,',u') is 
resolvable relative to P{i'), then {t,fi,u) is resolvable relative to P{i). 

More generally, an advanceable continuous homomorphism v: M{i') — > 
M{i) satisfying (|6.1|) also maps DIS(/i', P(i'), 5) into DIS(/i, -P(i), S") if 
Ti{S){i) is compatible with the partial order P{i). 

Proof. Let D = DIS(/i, P(i), 5) and D' = mS{^i' ,P{i'),S). That {t,^i,u) 
is resolvable relative to P{i) is by definition that i(/i) — u{^) G D, and since 
t{ij) - uin) = v{t'{ii') - u'iii')) where t'(/i') - u'(/x') G D' , the first claim 
follows from the second: that v{D') C D. 



fail. 




(6.1) 
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Let a £ D' he given. Since D is topologically closed, it follows that 
v{a) € I? if it can be shown that v{a) € D+e for every e G 0{i). Let e G 0{i) 
be arbitrary. Let 6 G d{i') be such that v{S) C e. Let {i^j}f=i C 3;(i'), 
{rj}]Li ^ ±R*{i'), and C ri(5)(z') be such that 

m 

i=i 

and Vj < fi' in -P(i') for j = 1, . . . ,m. Let 6j = rj^vj) for j = 1, . . . , m and 

By advanceability of v there exist reductions {tj}^]^ C T{S){i) such 
that (tj o v){bj) = {vo t'j){bj) for j = 1, . . . , m. Since 6j G DSM(/i', P{i')) it 
follows that f G DSM(//, P{i)) for j = 1, . . . , m, and hence 

y m \ m 

<b) = v(Y.{b,-t'^{b,))\=Y,{vib,)-t,{v{b,)))eD 
S=i ^ i=i 

by Lemma 15.101 Furthermore v{a) — v{b) = v{a — 6) G v{5) C e, thus 
G D by the arbitrariness of e. □ 

The following concepts are useful when one seeks to prove that an ad- 
vanceable map (or family of advanceable maps) satisfies (j6.ip . 

Definition 6.4. A map v. M.{i') — > M.{i) is said to correlate P{i') to 
P{i) if 

v{ii)ey{i) =^ v(BSM{^^, P{i'))^ CBSM{v{fi),P{i)) (6.2) 

for all /i G y{i')- A map v: y{i') — 3^(i) is said to be monotone with 
respect to P{i') and P{i) if 

^ ;U in P{i') =^ v{iy) ^ ?;(^) in P{i) (6.3) 

for all yi^')- The map v is strictly monotone if 

ly < fiin P{i') =^ v{i') < v{fi) in P{i) (6.4) 

for all /U, 1/ G 3^(i')- 

It needs to be pointed out that there, particularly for functions with 
domain and codomain in M, exists a conflicting terminology which calls 
the property defined in (|6.3|) 'increasing' and instead defines 'monotone' 
as 'increasing or decreasing'; preferences vary. When 'monotone' as here 
means "preserves inequalities" then the corresponding name for "reverses 
inequalities" is antitone. Yet another name that might be seen for a map 
having properties like these is that it is 'compatible' with the order, but here 
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Definition 15.31 has already given that name to a different relation between 
maps and binary relations. 

The applied concept in this trio is that of a map which correlates -P(z') 
to -P(i): it covers the condition (j6.ip of Lemma [6.3l and it blends nicely with 
Construction 17.21 in that it reduces compatibility of simple reductions made 
from a rule (/i, a) to the matter of whether a € DSM(;U,P). On the other 
hand, it is usually monotonicity that is the goal when one constructs the 
relations {P(i)}^g^, so a small lemma bridging the gap may be in order. 

Lemma 6.5. Let I be sorts. Let v: A4{i') — > be a continuous 

homomorphism such that u(r(//)) € Cspan(|f (^u)}) for all fj. € y{i') and 
r £ R*{i'). Let P{i) he a binary relation on y{i) and let P{i') be a binary 
relation on y{i'). If v{v) G DSM(w(/x), P(i)) for all /U, z/ € 3^ such that 
u < fj,m P{i') and v{fi) S y, then v correlates P{i') to P{i). 

Proof. Let /i G y{i') such that v{ii) G 3^ be given. Let D = DSM(w(/x), P(i)) . 
li u £ y{i') satisfies u < ix'm. P{i') then by assumption v{v) G D and hence 
Cspan({f (i/)}) C D, which implies f (r(z^)) G D for all r G R*{i'). Since 
V IS a, continuous homomorphism and D is a topologically closed group, it 
now follows that 

T;(^Cspan({ v G y{i') \u < ^xin P{i') })) C D, 

i.e., V satisfies the condition at /i for correlating P(i') to P{i). □ 

Remark. The meaning of (16. 4p if v ranges over all maps in the family ()3.8p , 
as would be the setting for Bergman's diamond lemma, is that 

< /i in P(l) Az^p < A^p in P(l) for all /i, i^. A, p G X*, (6.5) 

i.e., P(l) must be a (strictly compatible) monoid partial order. The need 
for this very classical condition can thus in the generic theory be found only 
in the resolvability of ambiguities! It would be possible to apply the generic 
diamond lemma with a partial order that violates (16. 5p . but for that one 
would then pay the price that ambiguity resolution gets more complicated. 
Furthermore compatibility of reductions interacts with advanceability so 
that one anyway comes pretty close to needing correlation; in practice the 
choice one has is one of how many advanceable maps one will have rather 
than whether these will correlate the partial orders. 

The condition that advanceable maps should be continuous can similarly 
be regarded as a condition on how the multiplication operation on M{i) = 
'R-{X) should relate to the topology, and if for example (j2.6p holds then 
continuity of maps on the form (13. 8p can also be reduced to a condition on 
multiplication of monomials, but this point of view is not as striking as it is 
for the order on 3^, since continuity to the average mathematician is more 
of an everyday condition. 
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In most classical cases, Lemma 16.51 would be applied to maps v which 
map y{i') into in which case correlation implies strict monotonicity. 

Some algebraic structures will however give rise to "degenerate" maps which 
cannot be strictly monotone on the whole of y{i'), and at least in the notable 
case of path algebras (where the product of two monomials can be zero, see 
Subsection I8.3p the additional precondition that v{^) G 3^(z) provides a 
convenient loophole to avoid getting caught by this technicality. 

The final nontrivial condition in Lemma 16.51 is that v should map ele- 
ments on the form r(/x) into Cspan(|t;(/i)}) . The most common reason this 
condition would be fulfilled is that and ^A{i) are both 7^-modules for 

some ring TZ such that the advanceable map v. — > is 7^-linear, 

while R{i') and R{i) are the sets of actions of elements of TZ on and 
respectively. Note, however, that the general framework makes no 
assumption that maps in R{i') should have counterparts in R{i), or even 
that there should be a corresponding endomorphism on This is a 

reason why advanceability is a condition on how arbitrary elements of R*y 
are treated; for suitably linear maps it is sufficient to check advanceability 
on elements of y. 

Whether it could be useful to have R{i') and R{i) generate nonisomor- 
phic rings of endomorphisms on and A4{i) respectively remains to 

be seen, but not requiring 7^-linearity turns out to be advantageous for 
the related concept of biadvanceability. (Shadow ambiguities are not the 
only ones that are traditionally discarded; equally important are ambigui- 
ties where the parts are disjoint. Biadvanceability provides a way to give 
an abstract definition of this.) Recall that an 7^-bilinear map w satisfies 
w{ra,b) = rw{a,b) = w{a,rb), from which follows rsw{a,b) = rw{a,sb) = 
w{ra, sb) = sw{ra, b) = srw{a, b), for all r,s £ TZ. li TZ is commutative this 
is only natural, but if 7^ is a noncommutative ring then it places a rather 
severe restriction on the range of w: only elements at which all 7^-module ac- 
tions commute are allowed! This would often be insufficient for the intended 
uses of biadvanceable maps. 

A practical compromise that is sometimes available is to request some 
weak form of bilinearity. It might for example be the case that the identity 
w{ra, b) = rw{a, b) = w{a, rb) only holds for monomial a and b. It could also 
be the case that the identity is relaxed to w{ra, b) = r'w{a, b) (and similarly 
for moving out from the second position), where r' £ TZ need not be equal 
to r and may depend on a or b. The condition needed for Lemma 16.51 is 
weaker still — roughly that w{ra,b),w{a,rb) G Cspan({t(;(a, 6)|) for a,b £ 
y; see (j6.6p — and should therefore not be a problem to fulfil when necessary. 

Definition 6.6. Let i, 11,12 G / be sorts. A map w: x A4{i2) — > 

A4{i) is said to be a bihomomorphism if 

w{ai - 61, a2 - 62) = w{ai,a2) - it)(6i, 02) - it)(ai, 62) + w{bi, 62) 
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for all ai,6i G M.{ii) and 02,62 £ Al(i2)- A bihomomorphism w: M.{ii) x 
-^(^2) — ^ is said to be biadvanceable (with respect to T{S){i), 

T{S){ii), and T{S){i2)) if 

• there for every ti £ Ti{S){ii), bi G R*y{ii), and 62 € i?*3^(i2) exists 
some t G T{S){i) such that ^(^(61,62)) = ^1(^1(61), 62) , and 

• there for every t2 G T{S){i2), h G R*y{ii), and 62 G -R*!y(«2) exists 
some t G T{S){i) such that ^(^(61,62)) = "^(51, t2(62)) • 

An ambiguity {t, ji, u) of Ti{S){i) is said to be a montage of the pieces 
{\t') G 3^(ii) X T{S){ii) and (z^, u') G ^^(^2) x T{S){i2) if there exists a 
continuous biadvanceable map w: M.{ii) x M.{i2) — such that 

t{w{\,u)) = w{t'{X),u) 
u(^w{X,i')) =w(A,n'(z^)) 

The map w is called the composition map of this montage. 

Let Vi be a set of maps — ^A{^) and let V2 be a set of maps 

M{i2) — > M{i). A biadvanceable map w: M{ii) x M{i2) — > M{i) is 
said to be (Fi, V2)-biadvanceable if w{-,p) G Vi for all p G !y(i2) and 
w{p, •) G V2 for all p G 3^(ii)- A (Vi, V2)-montage ambiguity is a montage 
ambiguity where the composition map is (Vi, V2) -biadvanceable. 

The idea formalised by the montage ambiguity concept is to recognise the 
situation that the two pieces act on disjoint parts of fi — the pieces are like 
two small windows to completely different gardens that have been embedded 
into a large mural provided by the composition map, and the presence of an 
embedding cannot change the fact that the games that can be played in one 
garden are quite independent of what happens in the other. In Grobner basis 
theory this idea is known as Buchberger's First Criterion for eliminating 
useless critical pairs, although the identification is perhaps not obvious at 
this stage. The correspondence will be made clear in Corollary 18.51 however. 

In the i = ii = i2 = 1 case of M{1) = 7^(X) and 3^(1) = X*, with 
all maps of the form b 1— > z^i6i^2 for some i'i,i'2 £ X* being advanceable, 
the typical form of a biadvanceable map is w{a,b) = viav2bv2, for some 
2^i)i^2,t'3 € X* . Such 'multiplication with fixed extra factors' maps can be 
used to produce a great variety of biadvanceable maps, and the construc- 
tion does not require the multiplication operation to be associative, or even 
binary; pretty much anything that can be composed from multilinear opera- 
tions on and fixed elements of y will probably turn out to be biadvanceable, 
if simple reductions are constructed by putting every rule in every possible 
context. The underlying idea for making a biadvanceable map is however 
to take an element of y and cut out two disjoint pieces from it — the bi- 
advanceable map then consists of inserting the two arguments into these 
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two holes. How such a map may be interpreted depends very much on the 
underlying algebraic structure, but for the diamond lemma machinery it is 
sufficient that the biadvanceable maps exist. 

Lemma 6.7. Let {t,fi,u) be an ambiguity of Ti{S){i) that is a montage 
of the pieces {X,t') G y{ii) x Ti{S){ii) and {u,u') G y{i2) x Ti{S){i2). 
If T{S){j) is compatible with some partial order P{j) on y{j) for all j G 
{i,ii,i2} and the composition map w: M.{ii) x Ai{i2) — > M-ii) of the 
montage satisfies 

u;(^DSM(A,P(ii)),i/) Uu;(^A,DSM(i/,P(i2))) C DSM(u;(A, z^), P(«)) (6.6) 

then {t,fj,,u) is resolvable relative to P{i). 
Proof. The problem is to prove that 

t(/i) - u{fi) = w{t'{X), v) - w{X, u'{u)) = 

= w(t'(A), V - u'{v)) + ^^(^'(A) - A, u'{u)) G DIS(^, P{i), S) , 

and by symmetry it is sufficient to do the first of w[t'{X),iy — u'{v)^ and 
w{t'{X) — X,u'{v)), as the other is completely analogous. 

Let e G 0{i) be arbitrary. Let ei G 0{ii) be such that wisi, v — u'{v)) C 
e. Since t'{X) G DSM(A,P(ii)) there exist {Aj jJLi ^ y{ii) and {r^}^! C 
±i?*(ii) such that Aj < A in P{ii) for all j = 1, . . . , m and YlY=i '''ji^j) ^ 
t'{X) + El. Let Oj = rj{Xj) for j = l,...,m. By biadvanceability there 
exist {uj}^^ C T{S){i) such that Uj[w{aj,u)) = w[aj,u'{i')) for all j = 
1, . . . ,m. Since w{aj, u) G DSM(//, -P(i)) by ()6.6p . these satisfy 



(m K ni 

j=i / j=i 

m 



w 



m 

€^^a-Uj{a) a G DSM(/i,P(i)) I C 
C DIS(//,P(i),5) 

bv Lemma 15. 101 Therefore if (t'( A), — n'(i/)) G DIS(/U, P(i), 5) +e, and by 
the arbitrariness of e thus w (t' (X) , u — u' (v)^ G DIS(/x, P(i), S") , as claimed. 

□ 

The pieces are now in place for a definition of critical as in 'critical pair', 
i.e., "member of a (small) set of ambiguities that together cover all ways in 
which things can fail to resolve". The definition given is with respect to a 
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particular family of advanceable maps, since this is how it will typically be 
applied: when someone considers only those advanceable maps, then these 
arc the ambiguities that need to be explicitly checked. It is often natural to 
let the family of advanceable maps be (the set of morphisms in) a category, 
but there is no technical need for this. 

Definition 6.8. Let a family V = Ui je/ ^(^'•^) maps such that every 

V G V{i,j) is an advanceable continuous homomorphism M.{j) — > M.{i) 
be given. 

The family V is said to be a category if V{i,i) 3 id: M{i) — > M.{i) 
and V2OV1 € V{i^k) for all V2 € vi G V{j,k), and € /. The 

family V = [Jijei ^iiJ) is the category generated by Vi = [Jij&i ^li^j) 
if it is the smallest category that satisfies Vi{i,j) C V{i,j) for all i,j G /. 

An ambiguity {t,iJ,,u) of Ti{S){i) is said to be a V^-shadow of the am- 
biguity {t',iJ,',u') of Ti{S){i') if there exists some v G V{i,i') such that 
IJ, = v{iJ,'), t(iJ,) = v(t'{iJ,')), and u{iJ,) = v{u'{ijl')). If in addition {t',fi',u') is 
not a F-shadow of (t, fi, u) then {t, fi, u) is a proper F-shadow of {t', fi', u'). 
An ambiguity of Ti (S*) is said to be T^-shadow-minimal if it is not a proper 
V-shadow of any ambiguity of Ti{S). An ambiguity of T\{S) is said to be 
y-shadow-critical if it is not a proper F-shadow of any F-shadow-minimal 
ambiguity of Ti {S) . 

An ambiguity {t,ij,,u) of Ti{S){i) is said to be ^-critical if it is V- 
shadow-critical and is not a (^(i, ii), i2))-montage ambiguity for any 
h,i2 e I- 

If y is a category then the ^-shadow relation Qv — formally defined 
by {t,iJ,,u) > {t',iJ,',u') in Qy iff {t,ix,u) is a F-shadow of {t',iJ,',u') — is a 
quasi-order on the set of ambiguities. This point of view is instructive for 
understanding the definition of ^-shadow-critical; (t, fi, u) is a proper V- 
shadow of {t' , /j,', u') iff {t, ji, u) > {t', ji', u') in Qv and {t, /x, u) is F-shadow- 
minimal iff it is Qy-minimal. A first stab at defining F-shadow-critical 
would be to use F-shadow-minimal, reasoning that non-minimal ambiguities 
need not be considered critical as there is always some smaller ambiguity of 
which they are a shadow, but this fails if Qv is not DCC; a simple example 
of a family V for which this might occur is F = {D"-}^^q, where D{1) = 
and D(x"+^) = x'^ for all n G N. By only discarding those ambiguities which 
are proper shadows of a minimal ambiguity, one arrives at a concept which is 
as strong as minimality in the nice cases but is sufficient also in the strange 
cases. 

Theorem 6.9. Let V = [Ji j^jVih j) be a family of maps such that every 

V G V{i,j) is an advanceable continuous homomorphism — ^ Ai{i). 
For each i E I, let P{i) be a partial order on y{i) with which Ti{S){i) is 
com,pa,tible. If every v G V{i,j), for all i,j G /, correlates P{j) to P{i) then 
the following claims are equivalent: 
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(a') Every ambiguity ofTi{S){i) is resolvable relative to P{i), for all i ^ I. 

(a") Every V-critical ambiguity of Ti{S){i) is resolvable relative to P{i), 
for all i ^ I . 

Proof. Since the ^-critical ambiguities of (a") are included among the am- 
biguities of (a'), all that needs to be shown is that the non-y-critical ambi- 
guities are resolvable relative to P whenever the ^/-critical ambiguities are 
so resolvable. Hence assume (a"). 

If an ambiguity {t,fi,u) of Ti{S){i) is not F-shadow-critical, then by 
definition there exists some ^/-shadow- minimal ambiguity, say {t',iJ,',u') of 
Ti{S){i'), of which (t,fi,u) is a proper F-shadow. Since {t',fi',u') is min- 
imal it is not a proper F-shadow of any ambiguity, and hence {t',ij,',u') 
is F-critical. By (a"), {t',iJ,',u') is resolvable relative to P{i'), which by 
Lemma 16.31 implies that {t,fj,,u) is resolvable relative to P{i), as claimed. 

If an ambiguity (i, //, u) of Ti{S){i) is a [V{i, ii), V{i, i2))-niontage ambi- 
guity for some ii,i2 & I then it is resolvable relative to P{i) by Lemma l6.7t 
(j6.6p holds because V{i, ii) is a set of maps correlating P{ii) to P{i), V{i, 12) 
is a set of maps correlating P{i2) to P{i), and the composition map w of 
the ambiguity {t,iJ.,u) is ii), i2))-biadvanceable. □ 

It should be observed that the set of ^/-critical ambiguities is not always 
the smallest set of ambiguities with which one can make do; if (t, /i, u) is 
F-critical then every {t',iJ.',u') such that {t,fi,u) ~ {t',fj,',u') in Qy is V- 
critical as well, even though it is clearly sufficient to check one ambiguity in 
each Qy-squivalence class. In actual calculations this often corresponds to 
being able to pick one labelling of an ambiguity and resolve it in that context, 
instead of having to write a resolution proof for arbitrary labellings. 

Another labour-saving trick which goes beyond the definition of V- 
critical ambiguity is Buchberger's Second Criterion, which in its raw form 
is simply the observation that if three simple reductions ti, t2, and ^3 act 
nontrivially on the same fj,, then relative resolvability of two of the result- 
ing ambiguities {ti,n,t2), is), and (t2>/U,t3) implies the same for the 
third. Under mild extra assumptions on V, this criterion can be given the 
more traditional form that ^2) can be skipped if there exists some 
simple reduction t^ such that ^3) and (t2)M)^3) ^-^e both non-shadow- 
critical, since the latter two can then be assumed relatively resolvable on 
account of being shadows of other ambiguities. This criterion is of practical 
interest because it is often far less work to perform an explicit search for a 
matching ia than it is to explicitly resolve {ti,^,t2), but it is not as theo- 
retically important as the recognition of montage ambiguities (Buchberger's 
first criterion). 

Example 6.10. Let I = {1} and drop sort indices. Let A4 = TZ{X), 
y = X*, R = TZ, the topology be discrete, V = {b i-^ i^ibi^2}^, i/oex*' 



51 



the simple reductions be defined as in Corollary 13.91 (or equivalently Con- 
struction [7]2|) ; this is the setting for Bergman's diamond lemma. Which are 
then the ^/-critical ambiguities? 

Using the notation of (j3.5p . an ambiguity has the form (iAisiJ^D A*) *A2S2i'2) 
where Ai/z^^z^i = /U = X2fJ-s2'^2- By unique factorisation of // in X*, Ai is a 
prefix (left divisor) of A2 or vice versa — hence there exist k € {Ai,A2} and 
A^, A2 G X* such that Ai = kX[ and A2 = nX'2. Similarly vi is a suffix (right 
divisor) of z/2 or vice versa, whence there exist p € {I'l, 1^2} and ^'1,1^2 ^ ^* 
such that ui = u[p and V2 = ^/O. It follows that {tx^swi-, l^^'t\^s2V2) 
a shadow under v{h) = nhp of (iA'^si!/() /^'j ^Ai,s2i^^) where p' = A'^/is-^i^( = 
\'2Ps2^2i ^"^^ t'^i^ shadow is proper unless k = p = 1. Conversely any 
ambiguity {tx^sivx-, IJ--, t\2S2U2) which is a proper F-shadow must have Ai, A2 7^ 
1 or i>i,V2 7^ 1, so it follows that the constructed {ty^g^^i^, p' ,tx'^g,^i,'^) is V- 
shadow-minimal. Hence a ^-shadow-critical ambiguity {tx-^s-^,^-^, p,tx.^s2U2) 
has 1 € {Ai,A2} and 1 G {1^1, 1^2}- Without loss of generality it may be 
assumed that Ai = 1. 

If the ambiguity {tisiui, lJ',tx2S2i^2) such that ps^ is a prefix of A2, say 
•^2 = tJ-siT, then conversely Ps2t^2 is a suffix of vi = Tps2i^2, and consequently 
(tisiui, IJ',tx2S2V2) is a montage with the composition map w{bi,b2) = 61T62 
of the pieces {psi,tis^i) and {ps2'^2,tis2U2)- Hence a ^-critical ambiguity 
{tisiui, fJ',tx2S2V2) rather has p = \2TV1 where either (if ui = 1: an inclusion 
ambiguity) r = /x^j, f = 1^2, and ps^ = A2T1/ or (if vi ^ 1: an overlap 
ambiguity) /x^^ = A2T, ps2 = tv, and v = vi. Either way, the ambiguity is 
uniquely identified by the quintuplet (si, S2, A2, r, i'), which (as it happens) 
is the definition of ambiguity that was used in [2] . This has shown that all V- 
critical ambiguities are among those specified by Bergman and consequently 
Bergman's diamond lemma follows from combining Theorems 15.111 and 16.91 

It may also be observed that if the set S of rules is finite then the set 
of y-critical ambiguities is finite as well; for any given pair (si,S2) G S'^, 
a non-montage ambiguity on the form (tisj^ui, fJ',tx2S2V2) where at least one 
of I'l and U2 is equal to 1 must satisfy degA2 < degps^, and hence the 
number of F-critical ambiguities on this form can be at most deg psj^ ■ This 
gives the overall bound jS"!^ max^gg deg/i<j, or sharper \S\ "^^^g deg ps, for 
the number of ^-critical ambiguities, but the actual number is often much 
lower. 

Note how the finiteness of the set of ^-critical ambiguities whenever 5 
is finite requires that the montage ambiguities are discarded. This is not 
the case in the commutative counterpart A4 = TZlX], since in that case 
all montage ambiguities are shadows of the one with w{a,b) = ah. This is 
probably the reason that this important principle in the commutative theory 
is merely known as the "first criterion" . 

As was parenthetically remarked in the example, Bergman distinguishes 
between inclusion and overlap ambiguities, where the former have the prop- 
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erty that fis2 divides (is a subword of) fisi ■ Since the property of being a 
divisor can be expressed in terms of advanceable maps, these classes may 
be defined also in the more abstract setting. 

Definition 6.11. An ambiguity {t,fi,u) of Ti(5)(i) is said to be an inclu- 
sion ambiguity, where t is called the inner reduction and u is called the 
outer reduction, if there for every advanceable map v: A4{i') — > A4{i) 
and ifi',u') G y{i') x Ti{S){i') such that n = v{ii') and u{n) = v{u'{ii')) 
exists some t' G Ti{S){i') such that t{^) = v{t'{^')^. The inclusion is said 
to be proper if only one of the reductions fit the definition for being the 
inner reduction. 

An ambiguity is said to be an overlap ambiguity if it is neither a 
montage ambiguity nor an inclusion ambiguity. 

For handmade sets of simple reductions Ti{S) (or ditto sets of rules S 
from which they are made), inclusion ambiguities are rare, because they typ- 
ically mean the outer reduction is redundant and can be dropped without 
changing Irr(S') (by Theorem 15. 6p . Red(S'), or (by Lemma 14. lip . The 
situation is a bit different in sets of reductions that are automatically gen- 
erated by some completion procedure, since it is very common that special 
cases of a rule are derived before (and even used in deriving) the more gen- 
eral rule that one may find in the literature. Relying on Lemma 14.111 for 
simplifying the set of reductions would require keeping all reductions until a 
complete set is found and only then drop those which are redundant, but it 
is usually more practical to drop them as soon as the inclusion is discovered. 
The next theorem gives conditions for this. 

Theorem 6.12. For every i G I, let Ti{S'){i) C Ti{S){i) and a partial 
order P{i) on y(i) with which Ti(5)(i) is compatible be given. Assume 
there is some io & I and to G Ti{S){io) \ Ti{S'){io) such that there for every 
i & I, t & Ti{S){i) \ Ti{S'){i), and fj. G y{i) on which t acts nontrivially 
exists a continuous homomorphism v. M.{io) — > M.{i) which is advanceable 
with respect to Ti{S'){io) and Ti{S'){i) and also some fiQ G y{io) such that 
viiio) = n, tin) = v{to{^xo)), and ^;(DSM(/xo, P(io))) C DSM(/x, P(i)). 

If for all i £ I all ambiguities of Ti[S'){i) are resolvable with respect 
to P{i) and there for every /xq G 3^(io) on which to acts nontrivially exists 

some uq G Ti(5")(io) such that to(/"o) — ^^o(/^o) £ DIS(/io, -P(«o)) 5*')' ^hen 
for all i £ I all ambiguities of Ti{S){i) are resolvable with respect to P{i) 
andl{S'){i) =l{S){i). 

Proof. Let i G / be arbitrary. Let (t, /x, u) be an arbitrary ambiguity 
of Ti{S){i). li t,u G Ti{S'){i) then t{^i) - u{n) G DIS(^, P(i), 5') C 
DIS(//, P(i), 5) by assumption. 

If t G Ti[S){i) \ Ti{S'){i) and u G Ti{S'){i) then by assumption there 
exists some /iq G y{io) and an advanceable continuous homomorphism 
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v: M{io) — > M{i) such that v(DSM(/io, P(io))) ^ DSM(/i, P(i)), = 
v{fio), and t{fi) = v(^to{fio)). Since v(to(Aio)) = ti^J-) / A* = ^^(/"o) it fohows 
that to acts nontriviahy on ^uq, and hence there exists some uq € Ti(5")(io) 
such that tQ{no) = mo(/"o) (mod S' < in -P(^o))- By the second claim of 
Lemma [0| f (to(/^o)) = ^'(^o(Aio)) (mod S' < fi in P{i)), and by advance- 
abihty there exists some ui G T{S'){i) such that ui{^) = v[uq{^q)); in other 
words t{^) = tti(//) (mod 5' < in P{i)). 

It need not be the case that ui = u, or even that ui G Ti{S'){i), but 
typically tii(/i) 7^ /x and then there exist uia € Ti(5')(i) and «ib G T{S'){i) 
such that ni(/i) = ?ii{,(nia(;u)) and uia{^) 7^ /U. In this case, {u,fi,uia) is 
an ambiguity of Ti{S'){i) and u(^) = uia{n) (mod 5' < /i in -P(i)) since 
it is resolvable. Furthermore uia{pL) = ui{fi) (mod 5' < /i in -P(i)) by 
Lemma 15.101 and it follows that {t,fj,,u) is resolvable relative to P{i). In 
the degenerate case that ui{fj,) = fi, one has the curious situation that 
/i = v{uo{fio)) G u(DSM(/xo,P(io))) ^ DSM(//,P(i)); there must be an 
alternative expression for fi as a linear combination of strictly smaller ele- 
ments of y{i). Hence ii~u{iJ,) G DIS(/i, P{i), 5") by definition and therefore 
t{fi) = ui{fi) = fJ- = u{fj,) (mod S' < fj, in P{i)). 

The case t G Ti(5')(i) and u G Ti(5)(i) \ Ti{S'){i) is handled similarly. 
The case t,u G Ti{S){i)\Ti {S'){i) is handled by combining the two previous 
cases — having two advanceable maps v,v': Ai{io) — > and ^0)/^o ^ 

y{io) such that w(/io) = = ^''(/"o), *(/") = f (to(Mo)) , and u(/x) = v'(to(/^o))- 

As for the claim that Z{S'){i) = 2{S){i), it follows from Lemma (3. 71 that 

I{S'){i) = Cspan({ fi - t{fi) I /X G 3^(i), t G ri(5')(i) }) C 

C Cspan({ /X - t(/i) I /X G K^), * ^ ri(5)(i) }) = T(5)(i). 

Furthermore, if t G Ti{S){i) \ Ti{S'){i) and /i G y{i) can give a nonzero con- 
tribution to the second Cspan, i.e., if they are such that /x — t{fi) ^ 0, then 
by assumption there exist /xq G y{io) and an advanceable continuous homo- 
morphism v: A4(^o) — ^ A^(i) such that = v{ij,q), t(/x) = t'(to(A*o))) and 
v{'DSM{no,P{io))) Q BSM{n, P{i)). There also exists some uq G ri(5')(«o) 
such that to{fio) = uq{^q) (mod S' < /xq in P{io)) and some n G T(S")(i) 
such that u{v{fio)) = 'L'(iio(/xo)) • Hence 

/i - t(/x) = w(/xo - to(/"o)) = ^'(/"o - ^io(/"o)) + v{uo{no) - toino)) e 

G {^l-u{^i))+v(BlS{^,o,P{io),s')) c 

C T(5') (i) + DIS (/X, P(i) , 5') = I{S') (i) 
and thus I{S) (i) C T(5') (0 . □ 
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7 A framework construction 



In the last couple of sections, the generic theory has been developed to a 
point where it is comparable to the ring theory diamond lemma provided 
that one can set up the necessary framework. The ideas behind the stan- 
dard construction have already been presented, but it is convenient to collect 
everything in a formal statement to facilitate citations in other papers. Fur- 
thermore the last couple of sections have demonstrated that one typically 
wants a bit more than just the basic framework assumptions, so in sup- 
port of the framework construction there are also some lemmas which give 
more elementary conditions that suffice for establishing the advanceability, 
compatibility, and equicontinuity properties. 

The first lemma concerns the basic construction of a topology. It serves 
mainly as a preparation for Lemma 17.61 and may certainly be skipped if 
one is only interested in a discrete topology. For simplicity, it is stated in 
single-sorted notation. 

Lemma 7.1. Let IZ he an associative unital ring with ring ultranorm |-| 
which is complete in the topology induced by this norm. Let some nonempty 
set Y and a function U : Y — > be given. Let A4 be the free TZ-module 
with basis Y. For every fi e Y , let f^: M. — > TZ be the coefficient- of- (i 
homomorphism, i.e., f^{fj) = 1 for all fi GY and /^(z^) = for all fi,]^ eY 
such that fi y. Define 

||a|| := max|/„(a)| ?7(/x) (7.1) 

for all a M. Then \\-\\ is an TZ-module ultranorm on M. Let 

R= {a>-^ r ■ a : M — > M\r e TZ,\r\ l} , 

Bn= {ae M\ \\a\\ Kl^-"-} forn^l. 

Then O = {-Bn}5?Li satisfies Assumption and every f^ for fi ^ Y is 
continuous. The extensions o/ {/^}^gy and \\-\\ to M. by continuity satisfy 
fTT]) and 

||/^(a) -/xll ^ ||a|| for all fie Y, (7.2) 

for all a e M. 

Proof. That the right hand side of ()7.ip exists for all 6 € follows from 
the fact that { € ^ | /^(6) 7^ } is finite for every h e M.. The claim 
that ll'll is an 7?.-module ultranorm is shown by verifying the conditions in 
Definition 12.91 ||a|| is nonnegative because it is a maximum of nonnegative 
numbers. Since |-| satisfies the strong triangle inequality. 



6|| = max |/^(a -h)\U{fi) ^ maxmaxi |/^(a)| , 1^(6)1 \ u{fi) = 
fieY /igY L ' 'J 

= max< max |//t(a)| U{fi), max \ fu.{b)\ U{fi) > = max{||a|| , ||6j|} 
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for all a,b e M. \\a\\ = iff = for all fi e y, which holds iff 

ffj,{a) = for all fJ, £ y, which in turn is true iff a = 0. For every r € 7^ and 
a e M, 

\\r ■ a\\ = max \ fu{r ■ a)\ U{fi) = max |r • fa{a)\ U{ii) ^ 

^ max|r| |/u(a)| U{^) = \r\ \\a\\ . (7.3) 

IJ.GY 

Hence ||-|| is an 7^-module ultranorm. As such, it is also a uniformly con- 
tinuous function Ai — > [0, oo[ C M since the absolute value of ||a|| — ||6|| by 
the triangle inequality is bounded from above by \\a — b\\. By the complete- 
ness of the codomain, it follows that ||-|| extends by continuity to a function 
Ai — > [0, oo[, and this extended map will also be an 7^-module ultranorm 
because left and right hand sides in the axioms for this are all continuous 
and thus the axioms are preserved under taking limits. 

It follows from ()7.3p that any set of all a € such that ||a|| < e for some 
e G M"*" is an i?-module. That fX^=i^n = {0} is because ||o|| = implies 
a = 0. Let e G and G y be given. Since |//i(a)| U{^) ^ ||a|| for all 
a £ M, it follows that any a £ A4 such that ||a|| < eU{ij) has |/^(a)| < e, 
and hence is continuous. By this continuity and the completeness of TZ, 
it extends to a map M — > IZ. Furthermore 

■ ^11 = fu{f^i{a) ■ fj) U{u) = max /^(a) • fu{^l) U{v) = 



ueY 



max 



[0,\f^{a)\U{fi)] = \U{a)\U{fi) ^ \\a\ 



which demonstrates (17. 2p for a € A^. By continuity of left and right hand 
sides it continues to hold for arbitrary a € A4. 

On the matter of (17. ip for a € A4\A4, one may first observe that ||a|| > 
and thus there exists some b £ M such that ||a — 6|| < ||a||. It follows 
from the strong triangle inequality that ||a|| = ||6|| = max^gy 
let £ Y he an element in which this maximum is attained. Since 
|//i(c)| U{fi) ^ ||c|| for all c E and fi £ Y, this inequality by continu- 
ity holds for all c E A4. Hence |/^(a — 6)| U{fi) ^ \\a — b\\ < \\a\\ and thus 

|^(a)| U{fi) = 1/^(6) + /^(a - 6)1 U{fi) ^ 

<max{|^(6)|;7(^),|^(a-6)|C/(^)} ^ ||a|| . 

On the other hand, 

l|a|| = I/moWI ^(^o) = - /Mo(a " ^)| ^(^o) ^ 



^ max 



{|/Mo(«)|^(^o),|/«(a-6)|[/(/io)} 
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and since \fii(}{a — b)\U{fi()) < \\a\\ it follows that |/^g(a)| C/(/io) = 
This has verified not only that the maximum in the right hand side of (jT.ip 
exists, but also that it equals the left hand side. □ 

The main specialisation made in this construction is that every A4{i) is a 
free 7?.-module, for some fixed ring TZ. The topological conditions may seem 
extensive, but they are all fulfilled in the case considered in Lemma 17. H 
and they are of course void in the case of a discrete topology. The classical 
case is furthermore that R{i) = TZ{i) and y{i) = Y{i), but as discussed in 
Section [21 things aren't always that simple. 

Construction 7.2. Let TZ he a unital associative topologically complete 
ring. For every i € I, let Y{i) be an arbitrary set and let M.{i) be the 
free 7^-module with basis Y{i). For every fi £ Yii), denote by the 
coefficient-of-;U homomorphism — > TZ. Also let TZ{i) be the ring of 
7^-actions on M{i), let R{i) be a subring of TZ{i), and let R-^{i) Q TZ{i) 
be a set such that X]re-R-L(j) -^(0 o r = TZ{i) and id G R^{i). Let y{i) = 
{ r{n) \ r e R^{i),n G Y(i) } ; this ensures Assumption [2] is fulfilled. For 
every i G I, let 0{i) = {Bn{i)} be a family of i?(i)-modules satisfying 
Assumption [3] and in addition being such that the 7^- module multiplication 
TZ X M.{i) — > : {r,h) ^ r ■ h and all maps are continuous. 

Let V = Uijg/^(^>i) such that every v € V^(i,j) is a continuous 
7?,-linear map M.{j) — > J^{i). Let S = Ujg/ '^(^) arbitrary such that 
S{i) C Y{i) X M{i). For all i,j G /, let 

W{i, j) = { {v, ifi, a)) G Vii, j) X S{j) I v{fi) G Y{i) } 

and define Ti{S){i) = \Jjei{^-",^}(.v,s)eW(i,j), where 

tv,M{b) = b - f^(^f,){b) ■ v{fi - a) for all 6 G (7.4) 

This Ti{S){i) satisfies Assumption H] and every r G TZ{i) is absolutely ad- 
vanceable with respect to Ti(S){i), for all i (z I. Furthermore every ambi- 
guity (fi, r(//), of Ti{S){i) where /i G Y{i) and r G R'^i'i) \ {id} is an 
absolute shadow of the ambiguity (ti,/i,t2)- 

Proof. Let i G / be given. That any t G Ti{S){i) is continuous and 7^-linear 
follows from ()7.4p since this formula is a composition of maps with these 
properties, and thus t is a continuous homomorphism satisfying tor = r ot 
for all r G TZ{i). Not only does this satisfy Assumption HI but it also means 
every r G TZ{i) is absolutely advanceable. □ 

Lemma 7.3. Let everything he as in the construction. Assume in addition 
that V is closed under composition and satisfies v{Y{j)^ C Y{i) U {0} for 
all V G V{i,j) and i,j £ I. If S is such that v{^) = implies v{a) = for 
V G V{i,j), (/U, a) G S{j), and i,j G / then for all i,j G / every element of 
V{i,j) is advanceable with respect to Ti(S){j) and Ti{S){i). 
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Proof. Let i,j e I, w e V{i,j) such that w{Y{j)) C Y{i) U {0}, t' G 
Ti{S){j), and b G R*y{j) be given. By definition of Ti{S){j) there exists 
some k £ I and (f, (/i,a)) G A;) such that t' = t^,{^,a)- Furthermore 
there exist v G and r £ TZ such that 6 = r • z^. There are three cases 
for v{t'{b)). 

1. If v(fi) 7^ 1/ then f^(^f^){b) = and hence = 6, in which case 
t{w{b)) = w{t'{b)) for t = id. 

2. If = and w{u) G then (to o tj, a)) G fc) and hence 
one can consider t = itooD,(/i,a)) which is the most interesting case. 
fv{fi){t') = ^ aiid hence = r •'L'(a), so that w{t'{b)^ = r-w{y{a)) = 
r ■ t[{w o = t[r ■ w{i')) = , as claimed. 

3. liv{fi) = V and w(v) = then {wov){^) = and hence {wov){a) = 0, 
which means w[t'{b)^ = w{r ■ f (a)) = r ■ {w o v){a) = r ■ {w o v){fi) = 
w{r ■ v{fi)) = w{b), and thus t(^w{b)) = w[t'(b)^ for t = id. 

Either way, there exists some t G T{S){i) such that = □ 

Lemma 7.4. Lei everything be as in the construction. Let a partial or- 
der P{i) on y{i) be given for every i £ I. Assume every v £ V{i,j) 
correlates P{j) to P{i) and every r £ R^{i) correlates P{i) to itself. If 
a £ DSM(;U,P(j)) for all {fJ,,a) £ S{j) and j £ I, then T{S){i) is compati- 
ble with P{i) for all i £ L 

Proof. Let i,j £ I, (v, (/i, a)) G W{i,j), and u £ y{i) be given. It must be 
shown that G {u} UBSM [u, P{i)) . If/^(^)(z^) = then = 

v and all is well. Otherwise u = r ■ v{fi) for r = fy(p){i') and t^,(^,a)(i^) = 
r ■ v{a). Since a £ DSM(/i,P(j)) it follows that v{a) £ i;(DSM(/i', P(j))) C 
BSM{v in), P{i)) and hence r • v{a) £ r ■ DSM{v {^), P{i)) C I)SM{u, P{i)) . 
Therefore all t £ Ti{S){i) are compatible with P{i). By Lemma 15.41 this 
extends to the whole of T{S){i). □ 

Lemma 7.5. Let everything he as in the construction. Let i £ I be given 
and let J\f ^ 0{i) he a family of topologically open subgroups of M.{i) such 
that fv{a) ■ u £ e for all a £ e, e £ J\f , and v £ Y{i). If all j £ I, 
(f, in, a)) £ W{i,j), r £ TZ, and e £ Af such that r ■ v{ii) £ e also satisfy 
r ■ v{a) £ e then: 

1. t{e) C e for any e £N and t £ T{S){i). 

2. T{S){i) is equicontinuous. 

Proof. The second claim is an immediate consequence of the first {6 = e 
works for all reductions). If the first claim holds for two particular re- 
ductions, then it also holds for their composition; hence it is sufficient to 
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verify it for simple reductions. Let t £ Ti{S){i) be given. By definition 
there is some j £ I and (v, (^,a)) G W{i,j) such that t = tt,,(^,a)) i-e-; 
t{b) = b — fv[^){h) ■ v{fi — a) for all b G M.{i). Let e £ N and 6 G e be arbi- 
trary. • f (y^) G e by the condition on M and hence fv(p){b) ■ v{a) G e 
by assumption. It follows that t{b) = b — fv{ii)ib) ■ v{fi) + fv{^){b) ■ v{a) G 

e — £ + £ = £. □ 

In [71 Lemma 3.25], a different proof of equicontinuity can be found 
which is feasible also in cases where the first conclusion of the above lemma 
does not hold; the idea is to consider T{S){i) that are compatible with some 
P{i) and require the latter to satisfy a 'squeeze property' (as in the Squeeze 
Theorem of elementary analysis): for every e G 0{i) there must exist some 
8 G 0{i) such that if /i G y{i) H e then every v < ^ m. P(i) must satisfy 
u £ 6. However, I currently don't have any example of a situation where this 
additional generality is needed. That proof is also easily disturbed by the 
existence of "small" scalars, since it might happen that the r(/x) G £r]R*y{i) 
some t G T{S){i) acts upon does not satisfy /x G e; it is typically necessary to 
have some condition ensuring that elements of R{i) act somewhat uniformly 
on and even then things can get hairy. 

The final lemma is instead a special case of Lemma 17.51 which separates 
the norm conditions on S, V, and TZ. 

Lemma 7.6. Let everything be as in the construction. Let a function 
Ui : Y(i) — > be given for every i £ L. Assume the topology in TZ is 
given by a ring ultranorm |-|. Also assume for all i £ L that 0{i) is con- 
structed from |-| and U{i) as in Lemma \7.1\ and let ||-||- be the IZ-module 
ultranorm on Ai{i). Assume that there for every v £ V exists a constant 
Cv G such that if v £ V{i,j) and /i G Y{j) then ||f (^)|| . ^ Ci,Uj{fi) and 
if in addition v{fj,) G Y{i) then Ui{y{iij) = CvUi{fi). 

VW'^Wj ^ Uj{fi) for all [fx, a) £ S{j) andj £L, then \\t{b)\\. ^ \\b\\- for all 
t £ T{S){i), b £ ^A{^), and i £ L, and moreover T{S){i) is equicontinuous 
for every i £ L. 

Proof. This is a special case of Lemma 17.51 In order for C e for any 
e G 7^ and t G T{S){i) to imply ^ \\b\\- for all t £ r(S')(i) and 

b £ it is necessary to take 

Af=\^{b£M{i)\\\b\\^<e} e G M+ } , 

but apart from that the proof is purely a matter of demonstrating that the 
conditions in Lemma 17.51 are met. That fuia) ■ i' £ £ for all a G e, e G A/", 
and V £Y{i) follows from ()7.2p . 

For the main condition, let i,j G /, (v, (/x, a)) G j), r £ TZ, and e G 
M"*" such that ||r • < e be given. By (I7.ip . ||r • "(^(z^)]!^ = |?"| = 
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CvUj{^). For every v € Y{j), 

r ■ v{fy{a) \r\ \\U{a) ■ v{u)\\. ^ |r| \U{a)\ \\v{u)\\. ^ 

^ \r\ \ fu{a)\ C.oUj{u) ^ \r\ \\a\\ - ^ |r| C^Ujifi) = \\r ■ v{fi)\\. < e. 

Since ||-||^ is an ultranorm and r • v{a) is in the topologically closed group 
generated by {r ■v[f^{a) ■ i')}j^^Y{j)^ ^'-'^ follows that ||r • < e. □ 

8 Grobner bases 

The following treatment of Grobner bases is primarily aimed at demonstrat- 
ing how some known results in this area can be derived from the diamond 
lemma, hence it does not seek to give a definition of Grobner basis that 
applies in all situations covered by the [M, R,y,0,Ti{S)) formalism. The 
restrictions that will be made are: 

1. There will only be one sort. 

2. The topology will be discrete. 

3. A4 will be a free left 7?.-module, where 7^ is a unital ring, and R will 
be the set of maps that multiply by an element of 7?.. 

4. y wiU be a basis of M. 

One restriction that will not be made is that of only considering total orders, 
as that is needed more to ensure existence of Grobner bases than to define 
or use them. Some algebraic structures require compatible partial orders to 
be non-total, so a restriction to total orders really sacrifices some generality. 



8.1 Generic theory 

On a practical level, the property that something is a Grobner basis is equiv- 
alent to the four claims in Theorem 15.111 which means several equivalent 
characterisations of this concept could be made. The standard definition 
is however the fifth claim that ^the leading monomial of an element of the 
ideal must be a multiple of the leading monomial of some element of the 
basis\ which accordingly appears as claim (d) of Theorem 18.41 One reason 
this characterisation has become so popular is no doubt that it is amenable 
to an informal presentation — everybody knows what the leading monomial 
is, don't they? — although once one starts to do anything with the concept 
(such as reducing modulo a tentative Grobner basis), most technical de- 
tails of a reduction-based approach quickly suggest themselves. Moreover, 
even the issue of what it means to be the leading monomial is not without 
technical complications when considered in the present generality. 
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Definition 8.1. Let {/^: M — ^T^}^^y be the family of coefficient-of-/i 
homomorphisms associated witti tiie basis y for A4. The support supp(a) 
of an a G Al is the set of /i G 3^ for which f^{a) ^ 0. 

Let P be a binary relation on 3^. A P-leading monomial of some 
a € is a P-maximal element of supp(a), i.e., a G supp(a) such that no 
v G supp(a) satisfies fj, < u in P. Denote by LMp(a) the set of P- leading 
monomials of a. If LMp(a) has exactly one element, then denote that by 
Imp(a). 

The main reason for restricting this treatment to 3^ being a basis of A4 
and the topology being discrete is that this ensures supp(a) — informally 
"the set of monomials occurring in a" — is well-defined. Linear dependencies 
in y would obviously remove the foundation for this concept, and topology 
can (but doesn't have to) produce similar problems: on can choose a topol- 
ogy such that there are Yi,Y2 C y for which Span(yi) n Span(y2) = {0} 
but Cspan(Yi) nCspan(y2) 7^ {0}. Even after ensuring that supp(a) is well- 
defined for all a G tM, a topology can cause the definition of LMp(a) to fail, 
if some supp(a) is infinite and contains an infinite P-ascending chain. One 
approach for defining Grobner bases without relying on the support concept 
could be to replace the concrete construction ofhM.p(g) by an abstract map 
L that assigns a set of leading monomials to each element of Ai . The effect 
would probably be similar to the formalism in [11], even though that techni- 
cally goes in the other direction: the "L" map has a canonical construction 
but the monomials are abstracted away. 

Definition 8.2. Let F be a set of 7^-module homomorphisms Ai — > A4. 
Let P be a binary relation on y. Let A'^ C be a V-ideal. A subset G of 
is said to be a P-monic y-Grobner-basis of A^ if it is P-monic, v{g) G A^ 
for all V V and g G and there for every a G N and ^ G LMp(a) exists 
some g € G and v gV such that fi = v(lmp{g)) . 

Making P-monicity a precondition for Grobner bases serves two pur- 
poses: it ensures there is a unique P-leading monomial and it ensures reduc- 
tions compatible with P can be manufactured from basis elements. While 
these are important ingredients in Lemma 18.31 the P-monicity condition 
also works against a very elementary result in traditional Grobner basis 
theory, namely that every ideal should have a Grobner basis. Without the 
P-monicity it would be possible to simply make the observation that the 
ideal itself is a Grobner basis for it — admittedly a ridiculously large basis 
(probably infinite in most cases where finite bases exist), but nonetheless 
a basis — which formally justifies assuming every ideal one needs to work 
with is generated by a Grobner basis. If a Grobner basis is to be P-monic 
however, one has to be careful about what elements can be included, but 
as long as P is a total order and 7^ is a field there is always a P-monic 
counterpart of every nonzero element of A4 . 
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It should also be observed that the definition of Grobner basis does not 
explicitly require G to be a F-ideal basis for N, and in fact it depends on P 
whether this follows. A trivial counterexample is to consider N = A4 = TZ[x] 
and G = {1 + x} where 7^ is a field, 3^ = {x"}„gN, V = {b t-^ x"6}„gN, and 
X™ ^ x" in P iff m ^ n; since lmp(l + x) = 1 it is easy to see that G is 
a l/-Grobner-basis for N, but 1 ^ Span({(l ^^'^ so G isn't a 

y-ideal basis for N. The catch in this example is that P isn't well-founded; 
the condition defining Grobner bases lends itself to the step in an induction 
for proving the ideal basis property, but it cannot also provide the base for 
that induction. The next lemma gives sufficient conditions on V and P for 
Grobner bases to be ideal bases. 

Lemma 8.3. Let V be a monoid ofTZ-module homomorphisms A4 — > Ai. 
Let P be a well-founded partial order on y which is correlated to itself by 
every v & V. If G Q M is a P-monic V -Grobner-basis for a V -ideal 
N A4 then G is a V -ideal basis for N. If furthermore Ti{S) is as in 
Construction \77^ for S = {(lmp{g),lmp{g) — 9)}g^Q then T{S) = N and 
for every a ^ N there exists some t E T{S) such that t{a) = 0. 

Proof. The main claim is that about existence of reductions which map 
elements of to 0. What the Grobner property implies is that there for 
every nonzero a (z N and fi € LMp(a) exists some t € Ti{S) which acts 
nontrivially on fi, namely t = t^^^_y^g^ where g G and v V are such 
that /i = v(lm.p{g)^ , since this is where = lmp{g) and b = v — g. All 

these simple reductions are compatible with P, since b E DSM(z^, P) because 
g is P-monic and v{b) € DSM(/i, P) = DSM(t;(z^), P) by assumption. 

Let ao € A^ be given. Construct from any nonzero an (z N the element 
a„+i = Un{an) by picking as u„ some composition Un,mn, ° - ■ ■ °Un,i of simple 
reductions such that Un,k G Ti{S) acts nontrivially on fin,k ^ LMp(a„), 
where {^n.i, • • • > Ain,m„} = LMp(on). The claim follows once it has been 
shown that a„ = for some n (which means ui = id for all I > n), and the 
way to establish this is to consider the sets LMp(a„). 

Let Z = UJ^Lo LMp(a„). First observe that Un{fJ'n,k) G DSM(/i„^fc,P) 
for any n and k by Lemma [5.41 Since any P-leading monomial of an is some 
lin,k, and since Span(supp(an) \LMp(an)) C X^^i DSM(^„^fc, P), it follows 
that 

LMp(a„+i) C supp(a„+i) C | e 3^ | i/ < in P for some € LMp(a„) } 

for n = 0, 1, . . .. Construct the directed acyclic graph D which has Z as 
vertex set and has an edge from /i to z/ iff /i > z/ in P and there exists some 
n € N such that ^ € LMp(a„) and u G LMp(a„+i). Since any path in this 
graph is a P-descending chain, it is finite. Since any LMp(a„) is finite, the 
graph has finite branching. Finally the only roots in D are the elements of 
LMp(oo). Hence Konig's lemma (an infinite tree with finite branching has 
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an infinite path) applies, and it follows that Z is finite. In particular, there 
exists some n for which LMp(a„) = and thus a„ = 0, as claimed. 

That C X(S') is now immediate from the definition of the latter. 
Conversely it may be observed that if — 7^ for some t G Ti{S) and 
H & y then there exist g G G and v € V such that — = v{g), since t 
is of the form • Thus 

I{S) = Cspan(^{i;(/x-a) G V,{fi,a) G S,v{fi) G 3^ }) C 

C Cspan(^{t;(c,) \ v eV,g eG}'^ CN 

and this also shows that G is a F-ideal basis of A^. □ 

With this result about the existence of reductions which map ideal ele- 
ments to 0, it becomes easy to link the Grobner basis concept to those of 
Theorem 15. Ill Claim (a'") below is included because it is literally the claim 
that "all S-polynomials reduce to 0" which is practically used to verify that 
something is a Grobner basis. 

Theorem 8.4. Let P be a well-founded partial order on y. Let V be a 
monoid of IZ-module homomorphisms M — > A4 that map y into y and 
are strictly monotone with respect to P. 

Let S ^yxM be such that a G DSM(/i,P) for any {n,a) G S. LetTi{S) 
he as in Construction \7.2[ Then the following conditions are equivalent: 

(a) Every ambiguity ofTi{S) is resolvable. 

(a') Every ambiguity ofTi{S) is resolvable relative to P. 

(a") Every V -critical ambiguity ofTi{S) is resolvable relative to P. 

(a'") For every V -critical ambiguity (ti,fj,,t2) of Ti{S) there exists some 
t G T{S) such that - t2{li)) = 0. 

(b) Every element of Ai is persistently and uniquely reducible. 

(c) Every element of A4 has a unique normal form. 

(d) The set {// — a}(/^,a)eS' ^-^ P-monic V -Grobner-basis ofZ{S). 

Proof. First observe that strict monotonicity of V implies correlation by 
Lemma 16.51 and hence T{S) is compatible with P by Lemma 17.41 All ele- 
ments of V are advanceable with respect to Ti{S) by Lemma 17.31 ^-nd thus 
T{S) is a F-ideal by Lemma ISTl 

Claims (a), (a'), (b), and (c) are equivalent by Theorem 15.111 Claims 
(a') and (a") are equivalent by Theorem 16.91 (a'") implies that every V- 
critical ambiguity is resolvable, and hence (a") by Lemma I5.1UI Conversely 
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(b) implies that /i G Red(5') for every F-critical ambiguity (ti, ^, t2) of ^1(5*) 
and hence = t^[t2{fi)), from which fohows {ti{^) - t2(/u)) = 

and thus (a'"). 

Assume (d). By Lemmas 15.51 and 14.21 Ai = Irr(5) +X(5). Furthermore 
every a G 2^(5) nlrr(S') satisfies t(a) = a for ah t G T{S), but by Lemma [8^ 
there is some t £ T{S) such that t(a) = 0. Hence a = 0, which has estab- 
hshed Ai = Irr(S') ©Z(S'). It follows that claim (d) implies claim (c). 

Finally assume Red(S') = M. and let b G ^{S) be arbitrary. Since 
t^{b) = there exists some t G T{S) such that t{b) = 0. Let A G LMp(6) be 
arbitrary. Since A ^ LMp(O), there exists some decomposition t = t3ot2oti 
where t3,ti G T{S) and ^2 G Ti{S) are such that A G LMp{ti{b)) but A ^ 
LMp((i2 o ti){b)). Since there is no G supp(ti(6)) such that u > Xin P, 
it must be the case that t2 acts nontrivially on A. Due to the way Ti(S) 
was constructed, this means there is some v £ V and (^u, a) G S" such that 
v{iJ,) = A. Since furthermore /j, = Imp {/i — a), the condition with respect 
to b and A for {fi — a}(p^a)es to be a l/-Grobner-basis is fulfilled. Hence 
claim (b) implies claim (d). □ 

A classical case not handled by this theorem is that P is a total or- 
der but elements of TZ sometimes aren't invertible. This is where monicity 
becomes a restriction, since there in for example the case that TZ is an eu- 
clidian domain exists an established theory — which in addition to gaussian 
elimination and polynomial division also generalises integer division (with 
remainder) — for Grobner bases where leading terms may have noninvertible 
coefficients. Reducing a term a^xu by a basis element g whose leading term 
is b/j, then consists of subtracting qgi' from a^u, where q is the quotient 
of a by b, and may thus fail to completely eliminate the //i^ term. What 
makes this incompatible with the diamond lemma framework is however 
that the quotient q, and hence the reduction as a whole, is not given by a 
homomorphism; (ai div6) + (02 div6) is not always equal to (oi + 02) div6, 
even through the error may be predictable. The standard bases formalism 
of [mill] has facilitie£] that can handle this, and it's quite possible that the 
same trick could be applied also in a modification of the diamond lemma 
formalism, but for the moment I don't see a pressing need for this. It is 
more interesting to examine some alternative approaches for coping with 
noninvertible coefficients within the present framework, even though they 
are perhaps not as general. 

^ In particular the duplication of addition operations: one which is "with carry" (com- 
ing from the filtered structure) and one which is "without carry" (coming from the as- 
sociated graded structure). For a suitable choice of filtered structure the latter addition 
has (ai div&) + (02 divfe) — {ai + a2) divfe, and since reductions are required to be homo- 
morphisms with respect to this "without carry" addition, it is then allowed to use integer 
division when constructing reductions. 
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If TZ can be regarded as an algebra over some smaller subring (maybe 
even sub field) /C, then a practical solution can be to change the boundary 
between R and y, letting the former encode just /C and extending y accord- 
ingly. Corollarv 18.61 below can be viewed as using this approach to Grobner 
bases in TZ{X) where TZ itself is a commutative polynomial algebra /C[Xi]. 

The other extreme is that 7^ = Z, in which case there is no additional 
freedom that can be gained from shrinking R since the group structure 
alone determines what multiples of monomials are mapped to. Consider the 
case that one wishes to make a model for Z[x]/{2x). It is easy to jump 
to the conclusion that the diamond lemma framework cannot handle this, 
on the grounds that TVi = Z[x] is a free Z- module and hence any Irr(S') 
must be free too whereas the target A4/Z{S) = Z[2;]/(2x) clearly is not. 
It is true that Theorem 18.41 is so restricted, but there is nothing in the 
generic theory which requires one to pick Z[x] as A4 (even though that 
would be the trivial choice). An interesting alternative in this case would 
be = Z[2;] © Z2[x2], since one for Ti{S) = {txri^x^}'^=i U {txOy^o} indeed 
gets Z[x]/(2x) ^ M/IiS) ^ Irr(S') as Z-modules. 

This approach of introducing "modular coefficients" in parallel with the 
original coefficients will however not work for the formally similar case of 
Z[x]/{2x - 1). Whereas a map that for all n ^ Z takes 2nx to n and 
(2n + l)x to n + X2 makes sense as a map (and indeed is what one would 
arrive at in the standard bases formalism), it cannot serve as a reduction 
in the diamond lemma formalism because it is not a group homomorphism; 
X2 + X2 = ^ 1. What will work is instead to pick Ai = Z,[^][x], where 
Z[^] should be regarded as the subring of Q generated by ^. The main 
justification for introducing such a multiplicative inverse of 2 would be the 
defining identity 2x = 1 itself — whose interpretation must be that x is 
precisely such an inverse — and once ^ is available the rest is trivial. 

The common idea generalising both cases appears to be that if one wants 
to make a reduction mapping rfi to a and r is neither invertible nor a zero 
divisor, then one should extend A4 with a new element //' such that rfi' = a 
and have the reduction map ^ to n' . (The 'new' is important here, because 
Z[x,y]/{2x - 2y) ^ Z[?/] CZ2[C,y] where ^ = y - x'; that 2x' = 2y but 
x' ^ y since x' is new is what creates the characteristic 2 part.) Whether this 
method can be turned into an algorithm (as has been done for Grobner bases 
over e.g. euclidian domains) is at the time of writing unclear — automating 
this kind of modifications to the base group M seems highly nontrivial — 
but it should illustrate the usefulness of not having Theorem 15.111 restricted 
to the case of M being a free module, even though that assumption simplifies 
the results in this section quite considerably. 
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8.2 Commutative, associative, and nonassociative algebras 

Many forms of the fundamental theorem of Grobner bases turn out to be 
special cases, with particular choices of M. and V, of Theorem 18.41 and 
therefore follow from it as mere corollaries. The most classical is that for 
commutative polynomials over a field. 

Corollary 8.5 (Buchberger). Let IZ he a field, let X he a set, let M = 
TZ[X], and let y he the set of monomials (power products) in M.. Let V = 
{h I— > fib}fj,£y (a set of maps M. — > M). Let P he a well-founded semigroup 
total order on y. Define a map Z: {y X M) — >Mhy 

ryi, N / lcm(/ii,^2) lcm(^i,/X2) 

Zf(/ii,ai), (/i2,a2)) = fli 0.2 (8.1) 

where lcm(/Ui,/X2) denotes the least common multiple of [i\ and \i2- 

Let 5 C 3^ X he such that a € DSM(;U, P) for every {fi, a) £ S and let 



Ti{S) he as in Construction 7.2. Then the following are equivalent: 



1. {fi — a}(^,a)e5 P-monic V -Grohner hasis of Z{S). 

2. For every pair (si,S2) € 5^ there exists some t € T{S) such that 
t{ZisuS2)) =0. 

3. For every si = (/xi, ai) € S and S2 = {fJ,2, 02) S "S* such that fii and ^2 
are not coprime there exists some t € T{S) such that t(Z(si, S2)) = 0. 

Proof. This is mostly the equivalence of (d) and (a'") from Theorem 18. 41 but 
there are minor variations so it doesn't hurt to make the chain of implications 
explicit. 

The first claim is exactly (d), so this is equivalent to Red(5') = Ai. 
Let si = (/ii,ai) € 5 and §2 = ilJ'2,0,2) £ 5 be arbitrary. Let = 
lcm(/xi,/i2)//Ui and 1/2 = lcm(//i, ^2)7^*2 • Then Z{si,S2) = I'lai - 1^202 = 
vi{ai — Hi) — 1^2(02 — 1^2) £ and hence t^ (^Z[si, 82)) = 0. Since every 
value of t^ is attained by some reduction, this has shown that the first claim 
implies the second, and it is trivial that the second implies the third. 

It only remains to show that the third claim is in fact (a'"). To that 
end, let (f^^^su A*, i?;2,s2) be a ^/-critical ambiguity of Ti(S'). Let {^1,01) = Si 
and Vi = Vi{l) for i = 1,2. Then uifii = /i = 1^2/^2 and hence lcm(/xi,;U2) 
divides //. However if k := /x/lcm(/ii, /X2) 7^ 1 then (t^^^sj, /Li, t^2,s2) would be 
a proper ^/-shadow of (t^i/K,si! A'/'^) ^d2/k,s2)' which by criticality is not the 
case. Similarly gcd(;Ui,//2) 7^ 1 since one would otherwise have z^i = fj.2 and 
'^2 = fJ'i, in which case /i, ^1)2,52) would be a montage with composition 

map w{bi,b2) = &i62- Finally Z{si,S2) = viai - z^2«2 = - it.2,s2(/")- 

□ 
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Another applied specialisation of Theorem 18.41 would be to take y to be 
a monoid on the form X' x where X* denotes the free abelian monoid 
generated by Xi. This can be used to formally justify Grobner basis calcu- 
lations in TZ{X2) where the given relations contain some set Xi of commu- 
tative coefficients for which one doesn't want to fix the values, by making 
the calculations in TZ[Xi]{X2) instead. 

There is of course always the possibility to work in TZ{Xi UX2) and add 
relations to make elements of Xi commute with everything else, but that can 
get unintuitive and impractical (especially if Xi is large compared to X2). 
Another possibility would be to make a transcendental field extension of TZ 
with the variables in Xi, but that would then make it formally questionable 
to specialise to where the coefficients satisfy some algebraic relation. 

Corollary 8.6. Let TZ be an associative and commutative ring with unit, let 
Xi and X2 be disjoint sets, let A4 = TZ[Xi]{X2) , and let y be the monoid in 
A4 which is generated by Xi U X2. Write X* for the abelian submonoid of 
y which is generated by Xi alone. Let V = {b ^ Kubv} ^i,=x' ,^,uex* (0. set 
of maps M — > M ). Let P be a well-founded semigroup partial order on y . 

Let S (^y ^ M. be such that a G DSM(/i, P) for every (^u, a) € S" and let 
Ti{S) be as in Construction \ 7. 4 Then the three claims that Red(S') = ^A, 
M. = Irr(5) ©X(5), and {/x — a}(^,a)e5 ^-^ 0, P-monic V -Grobner basis of 
T[S) are each equivalent to the conjunction of the following two conditions: 

• For every octuplet ((/ii,ai), (/i2, a2),ri,r2,r^, i^i,i'2, i^s) ^ S'^x{X')^x 

such that /xi = rir2i^ii^2, = r2r3U2i'3, 1/1,1/2,^3 / 1, and 
gcd(ri,r2) = gcd(r2,r3) = gcd(ri,r3) = 1, there exists some t E T{S) 
such that t(r3aiz/3 — riuia2) = 0. 

• For every octuplet ((/ii,ai), (^2, 02), ri, r2, r3, vi,V2, 1/3) G S'^x(X*)^x 
{X2Y such that fii = rir2 1/11/21/3, fJ-2 = r2r^V2, (/^i,ai) / (/U2,a2), and 
gcd(ri,r2) = gcd(r2,r3) = gcd(ri,r3) = 1, there exists some t € T{S) 
such that t(r3ai — riz/i 021/3) = 0. 

Proof sketch. Same overall structure as in the proof of Corollary 18.51 only 
the identification of ^/-critical ambiguities needs to be revised. This splits 
into a noncommutative part for X| which is the same as in Example 16.101 
and a commutative part for X* which is the same as in Corollary [831 D 

Corollary 8.7 (Gerritzen [6J). Let TZ be a field, let X be a set, let y be 
the free magma Mag(X) on X , and let Ai be the free TZ-module with basis 
y. Extend the multiplication on y to M by bilinearity, so that Ai is the 
(nonunital) free nonassociative TZ-algebra TZ{X} on X. Let Vi be the set of 
all maps M — > M : b ^ vb and M — > M : b ^ bv for u £ y. Let V be 
the monoid (with composition as operation) generated by V. 
Let P be a well-founded total order on y such that 

X < fi in P =^ Xv < /xz/ in P and vX < i//i in P (8.2) 
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for all XjfijU € y. Let S* C 3^ x be such that a G DSM(/i, P) for every 
{^Ji,a) € S and let Ti{S) he as in Construction 1.2. Then the following are 
equivalent: 



1. {/i — a}(^,j')g5 is a P-monic V -Grohner basis ofI{S). 

2. M =lvv{S)®l{S). 

3. For all (/Ui, ai), (/i2, «'2) £ S and v such that fii = v{pL2) there 
exists some t € T(S) such that t(ai — ^(02)) = 0. 

Proof. It follows from (j8.2p that all elements of V are strictly monotone with 
respect to P. Hence the conditions in Theorem [831 are fulfilled and one only 
has to verify that the last condition is (a'") by characterising the ^/-critical 
ambiguities. 

An arbitrary ambiguity of Ti{S) has the form (ii,i,(^i,ai), *«;2,(A'2,a2)) 
where vi{fii) = /i = V2(/^2)- The situation in the last condition is exactly 
this for vi = id or V2 = id, so it only remains to show that all other ambi- 
guities are non-y-critical. Unique factorisation in y gives rise to a unique 
factorisation in V (as compositions of elements of Vi), and thus there exist 
v[,V2 € Vi and v'{, € V such that vi = v[ o v'^ and f 2 = ^2 ° '^2- 

If v[ = V2 then (^',(^i,ai),^^i(Ati),^',(/x2,a2)) is another ambiguity, of 
which (i«;i,(^i,ai)) A*) ^i)2,(Ai2,a2)) is ^ proper ^/-shadow. Since there for every 
H (z y is only finitely many (v, z^) £ V x y such that = v^v), it follows 
that y-shadow-critical is the same as IZ-shadow-minimal, and hence none 
of the ambiguities with v[ = v'2 are ^-critical. 

If instead v'l 7^ V2 then one of these must multiply on the left and the 
other must multiply on the right; it can without loss of generality be assumed 
that Vi{h) = uih and V2(h) = bu2. This implies that fj, = viV2 = ^^2 (/^2)^'i (mi) 
however, and thus (it,i,(^i,ai)> ^, *i'2,(M2,a2)) is a montage of {y2,t^'i,{^,^,a^)) and 
(1^1, tj,// (^2,02)) with composition map -^(61, 62) = 6261. Hence the ambigui- 
ties with v'^ 7^ V2 aren't ^-critical either. □ 

8.3 Path algebras 

There is in the literature also a theorem by Farkas, Feustel, and Green [5] 
which similarly characterises (reduced) Grobner bases in path algebras and 
certain semigroup algebras; the result is derived in an axiomatic setting gen- 
eralising path algebras. Not surprisingly, it is in that setting equally possible 
to derive from the generic diamond lemma theory a result on more general 
(uniform monic) Grobner bases in these algebras. The proof is essentially 
the same as for Theorem 18. 4i but the result is not Yet Another Corollary 
due to some technicalities caused by allowing the product of two monomials 
to be zero. 

In the present notation, one is given a field TZ and an associative TZ- 
algebra M. with basis 3^. This basis is assumed to be well-ordered, so let P 
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be that order. Another binary relation divides, or symbolically |, is defined 
on 3^ by ;U I A iff there exist 1^1,1^2 ^ y such that A = z^i^z^2- These data are 
furthermore required to satisfy five axioms: 

Ml. 3^U{0}cA4isa semigroup under multiplication. 

M2. 'Divides' is reflexive. 

M3. For each A € 3^, the set { ^ £ 3^ | divides A } is finite. 
M4. If fj,,^, X, p y are such that none of the products below are zero, then 
u < pin P =^ Xup < Xpp in P. (8.3) 

M5. If ^ I A then ps^XinP. 

In the case that M. is the path algebra 7^(r) and 3^ is the set of all path^ 
in r (counting vertices as paths of length 0), axioms M1-M3 are trivial 
properties; in particular Ml is characteristic. M4 is a natural modification 
of the monoid partial order axiom (j6.5p and M5 is another condition on P; 
the authors suggest that one meets it by using a length-lexicographic order, 
although a weighted-degree lexicographic order will work just as well. It 
should be observed that y U {0} is typically not a monoid, since the unit in 
a path algebra is the sum of all length paths rather than any particular 
path. 

Simple reductions may be constructed as in Construction 17. 2( with V 
being the set of maps b 1— > Xhp for A, p E 3^; this is exactly the same as in [Sj 
p. 731]. Similarly the definition there of a (P-monic) 'Grobner generating 
set' is exactly the same as 'P-monic ^-Grobner basis' here. Axiom M4 is 
exactly what is needed in Lemma 16.51 to establish that V correlates P to 
itself, and then the compatibility with P of T{S) follows from Lemma [7.4l for 
any S constructed as in Lemma 18.31 It is however not quite as straightfor- 
ward to apply Lemma 17.31 to prove that the elements of V are advanceable. 
Besides the trivial detail that V is not in general closed under composition 
— if vi{h) = Xibpi and ^2(6) = A26/92 then {vi ov2){b) = A1A26/O2P1 which 
is only an element of y if A1A2 7^ and P2P1 7^ 0, although that can be 
worked around by considering 1/ U {0} instead — there is in this lemma also 
the more significant condition that every (p, a) (z S and v €V must satisfy 
v{a) = if v{p) = 0. This is why the result was above described as being 
about uniform monic Grobner bases. 

In [H p. 733], two elements PjU y are defined to be uniform-equivalent 

if 

Xpp = <^ Xup = for all X,pey. (8.4) 

^ To be formally correct, one should really say walk rather than 'path', since a path 
(as all graph theorists know) may not have any repeated vertices, but speaking of 'walk 
algebras' here would probably cause more confusion than it avoids. 
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In a path algebra, this simply means that /i and v have the same end- 
points, but in principle the matter might be more complicated. Nonetheless, 
uniform-equivalence is an equivalence relation on y and defines a partition 
of 3^ into equivalence classes. An element a of is said to be uniform 
if all elements of supp(a) are uniform-equivalent, and consequently a pair 
(/i, a) € 3^ X can be said to be uniform if every element of supp(a) 
is uniform-equivalent to ^. Considering only uniform Grobner bases may 
seem like a severe restriction, but at least in the case of a path algebra it 
is actually rather trivial. The reason for this is that there is in a path al- 
gebra no way in which a path can be "uniform-superior" to another path; 
they're either equivalent or quite different. More concretely, if /i, G 3^ are 
not uniform-equivalent then for each u € at most one of f and v{y^ 
can be nonzero. This has the effect that only the uniform parts of rules get 
encoded into Ti{S); for ti,,(;x,a) to even exist v{fi) must be nonzero and thus 
all £ supp(a) which are not uniform-equivalent to fi will be killed by v. 

In a path algebra, it is easy to see that any ideal is generated by a 
set of uniform elements; writing Tq for the set of vertices in T, any a S 
7^(r) can be expressed as the sum of uniform elements YIk peVo ^^P^ 
these terms are elements of every ideal containing a. That the same should 
hold in general is not obvious, but any algebra satisfying M1~M5 must 
contain idempotent elements which fill the role of vertices in this argument; 
in particular axiom M2 is not as innocent as it may seem, since what it 
claims is really that there for every // G 3^ exist k, p £ y such that Kfip = fi. 
The structure of algebras satisfying M1-M5 is the subject of [5, Sec. 4], and 
the conclusion is roughly that any such algebra has to be a path algebra in 
which some paths have been identified. 

Anyhow, with Ai, TZ, y, V, P, S, and Ti{S) as above, it follows that 
(a), (a'), (a"), (a'"), (b), (c), and (d) of Theorem 18.41 are equivalent. (When 
employing Lemma 18.31 one must extend V with the identity map to make 
it a monoid, but since S is uniform that doesn't contribute any additional 
reductions.) The structure of F-critical ambiguities can be analysed as in 
Example l6.10t [5] gives the characterisation of overlaps between {^i, ai) and 
{H2, 02) as being determined by 1^1, ^2, A € 3^ such that fii = viX, 112 = Xi^2, 
i>2 / /^2, and V2 / P2- 
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